Quantum Mechanics - Concepts and Applications 


Tarun Biswas 


June 16, 1999 



Copyright 


Copyright ©1990, 1994, 1995, 1998, 1999 by Tamil Biswas, Physics Department, State 
University of New York at New Paltz, New Paltz, New York 12561. 


Copyright Agreement 


This online version of the book may be reproduced freely in small numbers (10 or less per 
individual) as long as it is done in its entirety including the title, the author’s name and 
this copyright page. For larger numbers of copies one must obtain the author’s written 
consent. Copies of this book may not be sold for profit. 


i 



Contents 


1 Mathematical Preliminaries 1 

1.1 The state vectors. 1 

1.2 The inner product. 2 

1.3 Linear operators . 5 

1.4 Eigenstates and eigenvalues. 6 

1.5 The Dirac delta function. 13 

2 The Laws (Postulates) of Quantum Mechanics 16 

2.1 A lesson from classical mechanics. 16 

2.2 The postulates of quantum mechanics. 17 

2.3 Some history of the postulates . 19 

3 Popular Representations 20 

3.1 The position representation. 20 

3.2 The momentum representation. 23 

4 Some Simple Examples 25 

4.1 The Hamiltonian, conserved quantities and expectation value. 25 

4.2 Free particle in one dimension. 30 

4.2.1 Momentum. 31 

ii 
















4.2.2 Energy. 31 

4.2.3 Position. 32 

4.3 The harmonic oscillator. 34 

4.3.1 Solution in position representation. 36 

4.3.2 A representation free solution. 39 

4.4 Landau levels. 42 

5 More One Dimensional Examples 45 

5.1 General characteristics of solutions. 45 

5.1.1 E < V(x) for all £. 46 

5.1.2 Bound states. 47 

5.1.3 Scattering states. 49 

5.2 Some oversimplified examples. 53 

5.2.1 Rectangular potential well (bound states). 55 

5.2.2 Rectangular potential barrier (scattering states) . 58 

6 Numerical Techniques in One Space Dimension 64 

6.1 Finite differences. 65 

6.2 One dimensional scattering. 66 

6.3 One dimensional bound state problems. 72 

6.4 Other techniques. 74 

6.5 Accuracy . 75 

6.6 Speed. 75 

7 Symmetries and Conserved Quantities 78 

7.1 Symmetry groups and their representation. 78 

7.2 Space translation symmetry. 82 

iii 
























7.3 Time translation symmetry. 83 

7.4 Rotation symmetry. 84 

7.4.1 Eigenvalues of angular momentum. 85 

7.4.2 Addition of angular momenta. 89 

7.5 Discrete symmetries. 92 

7.5.1 Space inversion. 92 

7.5.2 Time reversal. 93 

8 Three Dimensional Systems 96 

8.1 General characteristics of bound states. 96 

8.2 Spherically symmetric potentials. 97 

8.3 Angular momentum. 99 

8.4 The two body problem. 100 

8.5 The hydrogen atom (bound states). 102 

8.6 Scattering in three dimensions. 104 

8.6.1 Center of mass frame vs. laboratory frame . 106 

8.6.2 Relation between asymptotic wavefunction and cross section .... 107 

8.7 Scattering due to a spherically symmetric potential. 108 

9 Numerical Techniques in Three Space Dimensions 112 

9.1 Bound states (spherically symmetric potentials). 112 

9.2 Bound states (general potential) . 114 

9.3 Scattering states (spherically symmetric potentials). 116 

9.4 Scattering states (general potential) . 118 

10 Approximation Methods (Bound States) 121 

10.1 Perturbation method (nondegenerate states). 122 


IV 























10.2 Degenerate state perturbation analysis. 126 

10.3 Time dependent perturbation analysis. 128 

10.4 The variational method . 132 

11 Approximation Methods (Scattering States) 136 

11.1 The Green’s function method. 137 

11.2 The scattering matrix. 142 

11.3 The stationary case. 146 

11.4 The Born approximation. 147 

12 Spin and Atomic Spectra 149 

12.1 Degenerate position eigenstates. 150 

12.2 Spin-half particles . 153 

12.3 Spin magnetic moment (Stern-Gerlach experiment). 155 

12.4 Spin-orbit coupling. 158 

12.5 Zeeman effect revisited. 160 

13 Relativistic Quantum Mechanics 162 

13.1 The Klein-Gordon equation. 163 

13.2 The Dirac equation. 166 

13.3 Spin and the Dirac particle. 169 

13.4 Spin-orbit coupling in the Dirac hamiltonian . 170 

13.5 The Dirac hydrogen atom. 172 

13.6 The Dirac particle in a magnetic field . 177 

A ‘C’ Programs for Assorted Problems 180 

A.l Program for the solution of energy eigenvalues for the rectangular potential 

well . 180 


v 






















A.2 General Program for one dimensional scattering off arbitrary barrier .... 181 

A.3 Function for rectangular barrier potential. 182 

A.4 General energy eigenvalue search program. 183 

A.5 Function for the harmonic oscillator potential. 185 

A.6 Function for the hydrogen atom potential. 186 

B Uncertainties and wavepackets 189 


vi 







Preface 


The fundamental idea behind any physical theory is to develop predictive power with a 
minimal set of experimentally tested postulates. However, historical development of a 
theory is not always that systematic. Different theorists and experimentalists approach 
the subject differently and achieve successes in different directions which gives the subject 
a rather “patchy” appearance. This has been particularly true for quantum mechanics. 
However, now that the dust has settled and physicists know quantum mechanics reasonably 
well, it is necessary to consolidate concepts and put together that minimal set of postulates. 

The minimal set of postulates in classical mechanics is already very well known and 
hence it is a much easier subject to present to a student. In quantum mechanics such a set 
is usually not identified in text books which, I believe, is the major cause of fear of the sub¬ 
ject among students. Very often, text books enumerate the postulates but continue to add 
further assumptions while solving individual problems. This is particularly disconcerting in 
quantum mechanics where, physical intuition being nonexistent, assumptions are difficult 
to justify. It is also necessary to separate the postulates from the sophisticated mathe¬ 
matical techniques needed to solve problems. In doing this one may draw analogies from 
classical mechanics where the physical postulate is Newton’s second law and everything 
else is creative mathematics for the purpose of using this law in different circumstances. In 
quantum mechanics the equivalent of Newton’s second law is, of course, the Schrodinger 
equation. However, before using the Schrodinger equation it is necessary to understand 
the mathematical meanings of its components e.g. the wavefunction or the state vector. 
This, of course, is also true for Newton’s law. There one needs to understand the relatively 
simple concept of particle trajectories. 

Some previous texts have successfully separated the mathematics from the physical 
principles. However, as a consequence, they have introduced so much mathematics that the 
physical content of the theory is lost. Such books are better used as references rather than 
textbooks. The present text will attempt a compromise. It will maintain the separation 
of the minimal set of postulates from the mathematical techniques. At the same time 
close contact with experiment will be maintained to avoid alienating the physics student. 
Mathematical rigor will also be maintained barring some exceptions where it would take 
the reader too far afield into mathematics. 



A significantly different feature of this book is the highlighting of numerical methods. 
An unavoidable consequence of doing practical physics is that most realistic problems do not 
have analytical solutions. The traditional approach to such problems has been a process 
of approximation of the complex system to a simple one and then adding appropriate 
numbers of correction terms. This has given rise to several methods of finding correction 
terms and some of them will be discussed in this text. However, these techniques were 
originally meant for hand computation. With the advent of present day computers more 
direct approaches to solving complex problems are available. Hence, besides learning to 
solve standard analytically solvable problems, the student needs to learn general numerical 
techniques that would allow one to solve any problem that has a solution. This would 
serve two purposes. First, it makes the student confident that every well defined problem is 
solvable and the world does not have to be made up of close approximations of the harmonic 
oscillator and the hydrogen atom. Second, one very often comes up with a problem that is 
so far from analytically solvable problems that standard approximation methods would not 
be reliable. This has been my motivation in including two chapters on numerical techniques 
and encouraging the student to use such techniques at every opportunity. The goal of these 
chapters is not to provide the most accurate algorithms or to give a complete discussion 
of all numerical techniques known (the list would be too long even if I were to know them 
all). Instead, I discuss the intuitively obvious techniques and encourage students to develop 
their own tailor-made recipes for specific problems. 

This book has been designed for a first course (two semesters) in quantum mechanics 
at the graduate level. The student is expected to be familiar with the physical principles 
behind basic ideas like the Planck hypothesis and the de Broglie hypothesis. He (or she) 
would also need the background of a graduate level course in classical mechanics and some 
working knowledge of linear algebra and differential equations. 
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Chapter 1 


Mathematical Preliminaries 


1.1 The state vectors 


In the next chapter we shall consider the complete descriptor of a system to be its state 
vector. Here I shall define the state vector through its properties. Some properties and 
definitions that are too obvious will be omitted. I shall use a slightly modified version of 
the convenient notation given by Dirac [1]. A state vector might also be called a state or a 
vector for short. In the following, the reader is encouraged to see analogies from complex 
matrix algebra. 

A state vector for some state s can be represented by the so called ket. vector |s). The 
label s can be chosen conveniently for specific problems. |s) will in general depend on all 
degrees of freedom of the system as well as time. The space of all possible kets for a system 
will be called the linear vector space V. In the following, the term linear will be dropped as 
all vector spaces considered here will be linear. The fundamental property (or rule) of V is 

Rule 1 If |s), |r) € V then 

a|s) + b\r) € V, 

where a,b E C (set of complex numbers) 

The meaning of addition of kets and multiplication by complex numbers will become obvious 
in the sense of components of the vector once components are defined. The physical content 
of the state vector is purely in its “direction”, that is 

Rule 2 The physical contents of |s) and a|s) are the same if a £ C and a ^ 0. 

At this stage the following commonly used terms can be defined. 


1 
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Definition 1 A LINEAR COMBINATION of state vectors is a sum of several vectors 
weighted by complex numbers e.g. 

a\p) + b\q) + c\r) + d\s) + ... 

where a, b,c,d € C. 

Definition 2 A set of state vectors is called LINEARLY INDEPENDENT if no one mem¬ 
ber of the set can be written as a linear combination of the others. 

Definition 3 A subset U of linearly independent state vectors is called COMPLETE if any 
|s) € V can be written as a linear combination of members ofU. 

1.2 The inner product 

The inner product is defined as a mapping of an ordered pair of vectors onto C, that is, the 
inner product is a complex number associated to an ordered pair of state vectors. It can 
be denoted as (|r), |s)) for the two states |r) and |s). The following property of the inner 
product is sometimes called sesquilinearity. 

Rule 3 

(o|r) + b\u), c|s) + d\v)) = 

o*c(|r), |s» + b*c(\u), |s)) + a*d(\r), |u)) + b*d(\u), |v)). 


This indicates that the inner product is linear in the right argument in the usual sense but 
antilinear in the left argument. The meaning of antilinearity is obvious from rule 3. For 
compactness of notation one defines the following. 

Definition 4 is called the adjoint of V. For every member |s) € V there is a cor¬ 
responding member |s)t € and vice versa. The (s| (bra of s) notation is chosen such 
that 

l^) 1 = (4 = I s) 

The one-to-one correspondence ofV andV t is specified as follows through the corresponding 
members |r) and (r\. 

|r)t|s) = (r||s) = (r\s) = (|r), |s)) (1.1) 

where |s) is an arbitrary ket. 
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The names “bra” and “ket” are chosen because together they form the “bracket” of the 
inner product. From rule 3 and definition 4 it can be seen that 

(a\r) + b[u)y = a*(r\ + b*(u\, (1.2) 

(a(r\ + 6(rt|)^ = a*\r) + b*\u). (1.3) 

Using this new notation, rule 3 can now be written as 

Rule 3 


(a| r) + 6|«)) t (c|s) + d\v}) 

= (a*(r| + 6*(«|)(c|s) + d\v)) 

= a*c(r\s) + b*c(u\s) + a*d(r\v) + b*d(u\v). 


Another property of the inner product that is necessary for our applications is 

Rule 4 


( r \ s )* = < r l s > f = \s)Hr\^ = (s|r). 


At this stage it might have occurred to the student that state vectors are a generalization 
of vectors in arbitrary dimensions. In fact they will be seen to be of infinite dimensionality 
in most cases. The kets are like column vectors and the bras like row vectors of complex 
matrix algebra. The inner product is the equivalent of the scalar or dot product. 

Extending the analogy one can define orthogonality, and norm. 

Definition 5 Two nonzero vectors represented by the kets |r) and |s) are defined to be 
ORTHOGONAL if (r\s) = 0. 


Definition 6 The NORM of a vector |s) is defined as its inner product with itself viz. 
(s|s). Note that, for convenience, this is chosen to be the square of the usual definition of 
the norm. 


From rule 4 it is obvious that the norm of any vector must be real. Another rule that one 
needs can now be introduced. 

Rule 5 The norm of every vector in V is positive definite except for the zero vector (the 
additive identity) which has a norm of zero. 
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Now one can prove two useful theorems relating orthogonality and linear independence of 
a set of vectors. 

Theorem 1.1 A set of mutually orthogonal nonzero vectors is linearly indepejident. 


Proof: Let the set of mutually orthogonal vectors be {|/*)} where the label i distinguishes 
different members of the set. Here I shall choose i to be a positive integer. But the 
proof presented here can be readily generalized for i belonging to any set of integers 
or even a continuous set of real numbers. 

We shall prove the theorem by contradiction. Hence, let us assume that the set is 
not linearly independent i.e. some member | /*.) of the set can be written as a linear 
combination of the others. Then 


\fk) = ( L4 ) 

i^k 

Multiplying ( i.e. taking an inner product) from the left by (fj\ (j ^ k), one obtains 

ifjlfk) = (1-5) 

i^k 

From the mutual orthogonality condition the left side vanishes and the right side has 
only one term remaining i.e. 

0 = a j(fj\fj)- (!-6) 

From rule 5 we conclude that (fj\fj) cannot be zero and hence 

H = 0 Vj. (1.7) 

This leads to the right side of equation 1.4 being zero. But the vector | /&) is not zero. 
This contradiction completes the proof. 


Theorem 1.2 Members of a set of n linearly independent nonzero vectors can be written 
as a linear combination of a (nonunique) set of n mutually orthogonal nonzero vectors. 

Proof: Let the given set of linearly independent vectors be {| <7*)}- For convenience the label 
i can be considered to be a positive integer (i = 1, 2,..., n). However, a generalization 
for i belonging to any set of integers or even a continuous set of real numbers is 
possible. 

We shall prove this theorem by construction. Let us define a set of vectors {\fi}} 
(i = 1,2,... ,n) by 

ia> = « - E yllfli/i). 

j =1 \Jl\Jl/ 


( 1 . 8 ) 
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This set can be seen to be a mutually orthogonal set (by induction). If the |g fc )’s are 
linearly independent then all the |/fc)’s can be shown to be nonzero. Also it is evident 
from equation 1.8 that the \gQ : s can be written as a linear combination of the |/j.)’s. 
This completes the proof. 

Definition 7 A linear transformation from a linearly independent nonzero set {| <7*)} to a 
mutually orthogonal nonzero set {|/i)} is called ORTHOGONALIZATION. This is not a 
unique transformation and the one shown in equation 1.8 is just an example. 


1.3 Linear operators 


An operator defined on the space V is an object that maps the space V onto itself. If Q is 
an operator then its operation on a ket |s) is written as Q|s) and Q|s) £ V. An operator Q 
is a linear operator if 

Rule 6 


Q(a\r) + 6|s)) = aQ\r) + bQ\s ), 
where a,b £ C and |r), |s) £ V. 

The addition of two operators and multiplication by a complex number is defined by the 
following. 

Definition 8 

(aP + bQ)\s) = a(P\s)) + b(Q\s)), (1.9) 

where a,b £ C, |s) £ V and P and Q are linear operators (to be called just operators from 
here on as nonlinear operators will never be used). 

Product of two operators P and Q is defined to be PQ in an obvious way. 

Definition 9 

(PQ)\s) = P(Q\s)), (1.10) 


where |s) £ V. 

In general PQ QP. Hence, we define: 


Definition 10 The COMMUTATOR BRACKET (or just COMMUTATOR) of two oper¬ 
ators P and Q is defined as 


[P, Q\ = PQ - QP 


(1.11) 
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The following identities involving commutators can be readily proved from the above defi¬ 
nition. 


[P,Q] 

= ~[Q,P], 

(1.12) 

[P, Q + R] 

= [P,Q] + [P,R], 

(1.13) 

[P,QR] 

= [. P,Q}R + Q[P,R}, 

(1.14) 

[P,[Q,R}} + [R, [P,Q]] + [Q,[i?,P]] 

= 0 . 

(1.15) 


These are the same as the properties of the Poisson bracket in classical mechanics. Postu¬ 
late 2 in the next chapter uses this fact. 

Operation of an operator Q on a bra (s| is written as (s|Q and is defined as follows. 

Definition 11 

((s-|Q)|r) = {s\Q\r) = (s|(Q|r)) (1.16) 

where |r) € V. 

Another useful definition is: 

Definition 12 The adjoint of an operator Q is called and defined as 

QHs) = ((s\Qfi (1.17) 


where |s) € V. 

For the description of observables the following kind of operators will be needed. 

Definition 13 An operator H is said to be HERMITIAN (or SELF ADJOINT) if 

W = H (1.18) 

1.4 Eigenstates and eigenvalues 


Definition 14 If for some operator Q, there exists a state | q) and a complex number q 
such that 


Q\q ) = <?!<?>, 


(1.19) 


then q is called an EIGENVALUE of Q and \q) the corresponding EIGENSTATE. 


It is in general possible for more than one eigenstate to have the same eigenvalue. 
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Definition 15 When n{> 1) linearly independent eigenstates have the same eigenvalue, 
they are said to be (n-FOLD) DEGENERATE. 


For our purposes the eigenvalues and eigenstates of hermitian operators are of particular 
interest. If H is a hermitian operator, some useful theorems can be proved for its eigenstates 
and corresponding eigenvalues. 


Theorem 1.3 All eigenvalues of a hermitian operator H are real. 


Proof: If \h) is the eigenstate corresponding to the eigenvalue h then 

H\h) = h\h) (1.20) 

The adjoint of this relation is (see problem 4) 

(h\H* = h*(h\. 

As H is hermitian this is the same as 

(h\H = h*(h\. (1.21) 

Multiplying (that is taking the inner product) equation 1.20 from the left by (h\ one 
gets 

(h\H\h) = h(h\h). (1.22) 

Multiplying equation 1.21 from the right by \h) one gets 

(h\H\h) = h*(h\h). (1.23) 


Hence, barring the trivial case of \h) being the zero vector, equations 1.22 and 1.23 
lead to 


h = h*. 


(1.24) 


This completes the proof. 


Theorem 1.4 Eigenstates \h\) and \h 2 ) of a hermitian operator H are orthogonal (i.e. 
(h\ \h 2 ) = 0) if the corresponding eigenvalues hi and h 2 are not equal. 


Proof: By definition 


H\hi) 
H\h 2 ) 


hi\hi), 

h 2 \h 2 ). 


(1-25) 

(1.26) 
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As H is hermitian, using theorem 1.3, the adjoint of equation 1.25 is seen to be 


(hi\H = hi(hi\. 


Multiplying equation 1.26 from the left by (h\\ one gets 

(hi\H\h 2 ) = h 2 (^IM- 

Multiplying equation 1.27 from the right by \h 2 ) one gets 

(h l \H\h 2 ) = h l {h l \h 2 ). 

Subtracting equation 1.28 from equation 1.29 gives 


{hi - h 2 ){h 1 \h 2 ) = 0 . 


As hi 7 ^ h 2 this means 


This completes the proof. 


(hi\h 2 ) = 0 . 


(1.27) 

(1.28) 

(1.29) 

(1.30) 

(1.31) 


Corollary 1.1 From theorem 1.2 it can be shown that the orthogonalization of a set of 
n-fold degenerate eigenstates produces a set of mutually orthogonal n-fold degenerate eigen¬ 
states with the same common eigenvalue. 


Corollary 1.2 From theorem l.f and corollary 1.1, one can readily see that any set of 
linearly independent eigenstates of a hermitian operator can be linearly transformed (only 
the degenerate eigenstates need be transformed) to a set of mutually orthogonal eigenstates 
with the same eigenvalues. 


Definition 16 A set of eigenvalues is called DISCRETE if it has a one to one correspon¬ 
dence with some subset of the set of integers and any real number between two successive 
members of the set is not an eigenvalue. 


Definition 17 A set of eigenvalues is called CONTINUOUS if it has a one to one corre¬ 
spondence with the set of points on a segment of the real line. 


Hence, for a discrete set of eigenvalues (of a hermitian operator) the eigenstates can be 
labelled by integers and chosen such that 


(hj | hj ) v..j bjj 
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where i and j are integers, | hi) and | hj ) are eigenstates and S, t j is the Kronecker delta 
(equation 1.61 gives a definition). Rule 2 can be used to choose n % , the norm of the i-th 
eigenstate, to be unity. With this choice we obtain 

{h i \h j )=6 ij , (1.32) 

where i and j are integers. For continuous eigenvalues one cannot use equation 1.32 as the 
eigenstates cannot be labelled by integers. They will have real numbers as labels. It is 
very often convenient to use the eigenvalue itself as a label (unless there is a degeneracy). 
Hence, for continuous eigenvalues one writes the equivalent of equation 1.32 as its limiting 
case of successive eigenvalues getting indefinitely close. In such a limit the Kronecker delta 
becomes the Dirac delta function (see equation 1.63 for a definition). So, once again, using 
rule 2 suitably one gets 

{h\ti) = 6{h - h') (1.33) 

where \h) and \h') are the eigenstates with real number labels h and h' and 8{h — h') is the 
Dirac delta function. Note that in this case the norm of an eigenstate is infinite. 

Definition 18 The choice of suitable multipliers for the eigenstates (using rule 2) such that 
the right sides of equations 1.32 and 1.33 have only delta functions, is called NORMAL¬ 
IZATION and the corresponding mutually orthogonal eigenstates are called NORMALIZED 
EIGENSTATES or ORTHONORMAL EIGENSTATES. From here on, all eigenstates of 
hermitian operators will be assumed to be normalized according to equation 1.32 or equa¬ 
tion 1.33. However, very often for brevity equation 1.32 might be used symbolically to 
represent both cases. As these are mutually exclusive cases there would be no confusion. 

The completeness definition of section 1.1 can now be written in terms of discrete and 
continuous labels. 

Definition 19 A set of states with label i is said to be COMPLETE if any |s) 6 V 

can be written as a linear combination of the |/i*) i.e. 

\s) =Y^ a i\hi) (1-34) 

i 

where a* are complex coefficients. For continuous eigenvalues the above summation is to be 
understood to be its obvious limit of an integral over the continuous label (or labels). 

|s) = J a(h)\h)dh (1.35) 

where a(h) is a complex function of the label h. 

Now one can state and prove the completeness theorem for the eigenstates of a hermitian 
operator. The proof presented here is not for the most general case. However, it illustrates 
a method that can be generalized. In a first reading this proof may be omitted. 
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Theorem 1.5 An orthonormal (not necessary but convenient) set of all linearly indepen¬ 
dent eigenstates of a hermitian operator is complete. 


Proof: Let the hermitian operator be H and let the orthonormal set of all linearly inde¬ 
pendent eigenstates of H be with i as the label. For convenience, the label will 

be chosen to be discrete (i = 1,2,...). However, the proof can be readily extended 
for other discrete sets of labels as well as continuous labels. 

The theorem will be proved by contradiction. Hence, it is assumed that the set 
be not complete. From theorem 1.2 it then follows that there exists a complemen¬ 
tary set of orthonormal states that together with {|/tj)} will form a complete 

orthonormal set. This would mean that all |gj)’s are orthogonal to all |/ij)’s. The 
operation of H on \gf) can then be written as a linear combination of the complete 
set: 

H\9i) = Y dij\gj) + 51 b ij\hj). (1.36) 

3 3 

Multiplying from the left by (h^ | one gets 

(h k \H\ 9i ) = b ik , (1.37) 

where use is made of the orthonormality of the |</j)’s and |/ij)’s. As (hk\ is the bra 
adjoint to the eigenket \hk) with eigenvalue and H is hermitian, 

(h k \H = h k (h k \. (1.38) 

Using this in equation 1.37 one gets (using orthonormality) 

bik = h k {h k \gi) = 0. (1-39) 

Hence, equation 1.36 becomes 

H \9i) = Y a v\9j)- (i- 40 ) 

3 

Now let us consider the set V c of all states that are linear combinations of the \gi)’s 
i.e. 

\k) € V c 4=>- \k) = Y°i\9i), (1-41) 

i 

for some set of complex numbers Cj. It can be readily shown (problem 4) that 
(k\H\k)/(k\k) is a real number and hence would have some minimum value for all 
| k) € V c . If e is this minimum value 1 then for any | k) £ V c 

(k\H\k)/(k\k) > e. (1.42) 

1 If e = — oo one needs to be more careful, but the proof of the theorem still holds in an appropriate 
limiting sense. To be rigorous, one also needs to consider the possibility that the range of (k\H\k)/(k\k) for 
all |fc) is an open set. Then equation 1.42 does not have the possibility of equality. Here again a limiting 
choice is to be made for |gi) such that (an — e) —> 0. 
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Without loss of generality the first of the set {| gt)}, viz. |< 7 i), could be chosen to be 
the one for which equation 1.42 becomes an equality (theorem 1.2) i.e. 

(fl'il^lS'i) = e, (1-43) 

where it is noted that {gi\gi) = 1 from orthonormalization. Also from equations 1.40 
and 1.43 one sees that 

an = e. (1-44) 

If | k) £ V c then from equations 1.40, 1.41 and 1.42 one obta in s 

Y^CiCjCiij > e^|cj| 2 . (1.45) 

ij i 

As the Cj’s are arbitrary, one may choose them all to be zero except 

ci = l,c m = e + i6, (1.46) 

where e and 6 are real and m/1. Then from equations 1.45 and 1.44 it follows that 

^(®mi T ai m ) T i<5(a m i aim) T (e T 6 )(a mm ®) — 0. (1.47) 

For small enough e and 6, it can be seen that the last term on the left hand side will 
contribute negligibly and hence, the inequality can be violated with suitable choices 
for the signs of e and 6, unless 


®ml T aim — o, a m l ®lm — 0. 


This gives 


aim — ®ml — O' 


This being true for any m/1, one concludes from equation 1.40 that 


H\gi) = anlsi). 


(1.48) 

(1.49) 

(1.50) 


This, of course, means that | < 71 ) is an eigenstate of H thus contradicting the original 
statement that the | < 7 *)’s are not eigenstates of H. Hence, the set {|< 7 i)} must be 
empty and the set {|/ij)} must be complete. This completes the proof. 


From the completeness theorem 1.5, we see that if is a set of all ortho normal eigen¬ 

states of H then any state |s) can be written as 

\s) = Yl c si\hi). (1.51) 

i 


Definition 20 The coefficient c S i in equation 1.51 is called the COMPONENT of\s) along 
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Multiplying equation 1.51 from the left by (hj\ and using orthonormality one obtains 


c s j — (hj | s). 

Replacing this in equation 1.51 we get 

I s ) = X^)(^l s >- 


Symbolically this can be written as 


I s ) = l s >> 


(1.52) 

(1.53) 

(1.54) 


giving the object in parenthesis the meaning of an operator in an obvious sense. But this 
operator operated on any state produces the same state. Hence, it is the identity operator 


I = J2\hi)(hi 


(1.55) 


Equation 1.55 can be seen to be a compact mathematical statement of the completeness of 
the eigenstates {|/i*)}. 

Very often it is useful to define the projection operators corresponding to each |/ij). 


Definition 21 The projection operator for \ hi) is defined to be 

Pi = \hi)(hi\ 

which selects out the part of the vector |s) in the “direction” \hi). 

| ^) c s i\hf). 


Also from equations 1.55 and 1.56 


i = E p i- 


(1.56) 

(1.57) 

(1.58) 


We shall sometimes use equations 1.55 and 1.58 symbolically in the same form for continuous 
eigenvalues as well. However, it should be understood to mean 


1=1 \h)(h\dh 


(1.59) 


for the real valued label h. In the same spirit equation 1.53 will also be used for continuous 
eigenvalues and would be interpretted as 


I s ) = / \h)(h\s)dh. 


(1.60) 


In fact in future chapters, as a general rule, a summation over indices of a complete set of 
eigenvalues will be understood to be an integral over eigenvalues for continuous eigenvalues. 
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1.5 The Dirac delta function 


The Kronecker delta is usually defined as 


Definition 22 


where i and j are integers. 


$ij 


1 ifi = j, 

0 


(1.61) 


However, the following equivalent definition is found to be useful for the consideration of a 
continuous index analog of the Kronecker delta. 

>>.;/, /- (1-62) 

j 

where i and j are integers and f t represents the i-th member of an arbitrary sequence of 
finite numbers. 

The Dirac delta is an analog of the Kronecker delta with continuous indices. For 
continuous indices the i and j can be replaced by real numbers x and y and the Dirac delta 
is written as 6(x — y). The difference (x — y) is used as the argument because the function 
can be seen to depend only on it. Likewise fi is replaced by a function f(x) of one real 
variable. f(x) must be finite for all x. Hence, the continuous label analog of equation 1.62 
produces the following definition of the Dirac delta function. 


Definition 23 

J 8(x - y)f(y)dy = f(x), (1.63) 

where f(x) is finite everywhere. An integral with no limits shown explicitly is understood 
to have the limits — oo to +oo. 


From this definition it is seen that, f(x) being an arbitrary function, the only way equa¬ 
tion 1.63 is possible is if 6(x — y) is zero everywhere except at x = y. At x = y, S(x — y) 
would have to be infinite as cly is infinitesimal. Hence, the following are true for the Dirac 
delta function. 


5(0) = oo, (1.64) 

6(x) = 0 if x 7 ^ 0. (1.65) 

Because of the infinity in equation 1.64, the Dirac delta has meaning only when multiplied 
by a finite function and integrated. Some identities involving the Dirac delta (in the same 
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integrated sense) that can be deduced from the defining equation 1.63 are 


S(x) 

= *>(-x), 

(1.66) 

j6(x)dx 

= 1, 

(1.67) 

x6(x) 

= o, 

(1.68) 

6(ax ) 

= —Six) for a > 0, 
a 

(1.69) 

6(x 2 — a 2 ) 

= — [6{x — a) + 6(x + a)} for a > 0, 

(1.70) 

x)6{x — b)dx 

= 6(a — b), 

(1.71) 

f(x)6(x - a ) 

= f(a)6(x — a). 

(1.72) 


The derivatives of a Dirac delta can be defined once again in the sense of an integral. I 
shall consider only the first derivative 6'(x). 

j S\x - y)f(y)dy = —f(y)6(x - y) |+~ + j 6(x - y)f(y)dy, (1.73) 

where a prime denotes a derivative with respect to the whole argument of the function. 
Thus 

J s'(x - y)f(y)dy = f{x). (1.74) 

Some identities involving the S'(x) can be derived in the same fashion. 

S'(x) = —S'(—x), (1.75) 

x6'(x) = —6(x). (1-76) 


To understand the Dirac delta better it is very often written as the limit of some better 
known function. For example, 


8(x) 

= lim 

g^oo 

sin (gx) 

irx 

S(x) 

1 ( X 2 

= Inn — — exp- — 

a—>0 ay/TT \ a z 

S(x) 

- hi 

r 

exp (ikx)dk. 


(1.77) 

(1.78) 

(1.79) 


Problems 

1. The norm (s|s) of a vector |.s) is sometimes written as ||s)| 2 . In this chapter the norm 
has been defined from the inner product. However, it is possible to first define the 
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norm and then the inner product as its consequence. Such an approach needs fewer 
rules but is more unwieldy. The inner product is then defined as: 

(r\s) = ^[||r) + |s)| 2 *-i||r) + i|s)| 2 
+ (i- 1 )(||r)| 2 + ||s)| 2 )]. 

Prove this result using the definition of inner product and norm as given in this 
chapter. 

2. In equation 1.8, show that a linearly dependent set would give some of the |/j)’s 

to be the zero vector. 

3. Using the defining equation 1.11 of the commutators prove the identities in equa¬ 
tions 1.12 through 1.15. 

4. Prove the following operator relations (for all operators P and Q , |s) £ V, and a, b € C) 

(a) (Q|s»t = (s|Qt 

(b) = Q 

(c) (aP + bQ )t = a*Pt + b*Q t 

(d) (PQ )t = Qtpt 

(e) PQ is hermitian if P and Q are hermitian and [P, Q] = 0. 

(f) For a hermitian operator H and |s) £ V, (s|P|s) is real. 

5. Prove the corollary 1.1. 

6. Using the defining equation 1.63 of the Dirac delta, prove the identities in equa¬ 
tions 1.66 through 1.72. For the derivative of the Dirac delta prove the identities in 
equations 1.75 and 1.76. [Hint: Remember that these identities have meaning only 
when multiplied by a finite function and integrated.] 



Chapter 2 


The Laws (Postulates) of Quantum 
Mechanics 

In the following, the term postulate will have its mathematical meaning i.e. an assumption 
used to build a theory. A law is a postulate that has been experimentally tested. All 
postulates introduced here have the status of laws. 

2.1 A lesson from classical mechanics 

There is a fundamental difference in the theoretical structures of classical and quantum 
mechanics. To understand this difference, one first needs to consider the structure of 
classical mechanics independent of the actual theory given by Newton. It is as follows. 

1. The fundamental measured quantity (or the descriptor) of a sytem is its trajectory in 
configuration space (the space of all independent position coordinates describing the 
system). The configuration space has dimensionality equal to the number of degrees 
of freedom (say n) of the system. So the trajectory is a curve in n dimensional space 
parametrized by time. If Xi is the i-th coordinate, then the trajectory is completely 
specified by the n functions of time xfit). These functions are all observable. 

2. A predictive theory of classical mechanics consists of equations that describe some 
initial value problem. These equations enable us to determine the complete trajectory 
Xi(t) from data at some initial time. The Newtonian theory requires the Xi and their 
time derivatives as initial data. 

3. The Xi(t) can then be used to determine other observables (sometimes conserved 
quantities) like energy, angular momentum etc.. Sometimes the equations of motion 
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can be used directly to find such quantities of interest. 


The above structure is based on the nature of classical measurements. However, 
at small enough scales, such classical measurements (like the trajectory) are found to be 
experimentally meaningless. Thus, a different theoretical structure becomes necessary. This 
structure is that of quantum mechanics. The structure of quantum mechanics, along with 
the associated postulates, will be stated in the following section. It is itemized to bring out 
the parallels with classical mechanics. 

The reader must be warned that without prior experience in quantum physics the 
postulates presented here might seem rather ad hoc and “unphysical”. But one must be 
reminded that in a first course in classical physics, Newton’s laws of motion might seem 
just as ad hoc. Later, a short historical background will be given to partially correct this 
situation. About the “unphysical” nature of these postulates, very little can be done. Phe¬ 
nomena like the falling of objects due to gravity are considered “physical” due to our long 
term exposure to their repeated occurrence around us. In contrast, most of the direct 
evidence of quantum physics is found at a scale much smaller than everyday human experi¬ 
ence. This makes quantum phenomena inherently “unphysical”. Hence, all one can expect 
of the following postulates is their self consistency and their ability to explain all observed 
phenomena within their range of applicabilty. To make quantum phenomena appear as 
“physical” as classical phenomena, one needs to repeatedly experience quantum aspects of 
nature. Hence, this text (like most others) tries to provide as many examples as possible. 

At first sight, the reader might also find the postulates to be too abstract and com¬ 
putationally intractable. The next two chapters should go a long way in correcting this 
problem. 


2.2 The postulates of quantum mechanics 


In the following, the postulates of quantum mechanics are presented within a theoretical 
structure that has a flavor similar to classical mechanics. The reader is encouraged to 
observe similarities and differences of the two theories. 

1. The descriptor is given by the zeroth postulate. Its relation to measurements is 
somewhat indirect (see postulates 2 through 5). 

Postulate 0 The complete descriptor (hut not a measured quantity) of a system is its 
state vector |s) and the complete descriptor of an observable q is a hermitian operator 
Q defined to operate on any |s) € V. \s), in general, depends on as many variables 
as there are degrees of freedom and time. 
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2. Predictive power comes from the initial value problem described by the following 
somewhat generalized Schrodinger equation. 

Postulate 1 

in §i^ = l 2 - 1 ) 

where H is the hamiltonian operator obtained by replacing the classical position and 
momentum in the classical expression of the hamiltonian by their quantum operator 
analogs, h (m 1.0545 x 10 -34 Joide.sec.) is a universal constant. 2irh(= h ) is usually 
called Planck’s constant. 


3. Quantum measurements of observables are conceptually distinctly different from clas¬ 
sical measurements. Classical measurements can be theoretically predicted with indef¬ 
inite accuracy (but the theory fails completely at smaller scales) . Quantum mechanics 
can predict only the probabilities of measurements (at all scales but would very often 
be impractical at larger scales). Every observable has an associated operator that 
operates on the space of state vectors V. 


Postulate 2 All measurable aspects of observables are completely determined by the 
mutual commutators of their respective operators. These commutators are determined 
by the following transition from classical to quantum. 


{q,p} 


[Q,P] 

ih 


( 2 . 2 ) 


where q and p are classical observables with quantum operator analogs given by Q and 
P respectively and {, } is the Poisson bracket. 


Postulate 3 The possible results of measurement of an observable, represented by 
the operator Q, are its eigenvalues q $ only. 


Postulate 4 If a system is in a state |s) and a measurement of an observable rep¬ 
resented by the operator Q is made on it, the probability that the result will be the 
eigenvalue qi is proportional to 

£(ft|s}(s|ft) = 53Kft|s>| 2 (2.3) 

deg deg 

where \ qf) is an eigenstate corresponding to q t and the summation is over all degen¬ 
erate states with the same eigenvalue q t . The proportionality constant can be chosen 
arbitrarily for computational convenience. It is very often chosen to keep the total 
probability of all outcomes as unity. 


Postulate 5 If the result of the measurement is indeed q L , then after the measurement 
the system will collapse into a corresponding eigenstate \qf). 
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This completes the set of postulates necessary in a theory of quantum mechanics. To 
understand the theory we need to use these postulates in physical examples. The rest of 
the book will be seen to be creative applications of mathematics to do just this. 


2.3 Some history of the postulates 


At the end of the nineteenth century one of the major experimental results that baffled 
classical physicists was the blackbody radiation spectrum. Classical physics had been highly 
successful in explaining and predicting a large variety of phenomena but for the radiation 
spectrum of a blackbody it gave the absurd result of infinite emission at infinite frequency 
(sometimes called the “ultraviolet catastrophe”). Planck was able to suitably “explain” the 
experimentally observed spectrum by a hitherto arbitrary assumption that the energies of 
oscillators producing radiation of frequency v can have energies only in integer multiples 
of hu. This was the origin of the constant h. At the same time discreteness was also 
noticed in the frequency spectra of atoms. Such observations led to the hypothesis (by 
de Broglie) that just as Planck had noticed that electromagnetic waves have a discrete 
(particle) nature, particles (like elctrons) have a wave nature. The wavelength A of a particle 
of momentum p is given by h/p. This led Schrodinger to the equation in postulate 1 in a 
particular representation that will be discussed as the position representation in the next 
chapter. The “wavefunction” of Schrodinger’s equation is the equivalent of the state vector 
(postulate 0). The generalized properties of the state vector (linearity etc.) have their 
origin in the wavefunction. The state vector was later chosen as the descriptor to allow 
greater generality, mathematical convenience, and economy in concepts. 

The postulates 2 through 5 were discovered in the process of consolidating experimen¬ 
tal observations with a theory of wavefunctions (or state vectors). 

After this rather short and oversimplified narration of the history of quantum physics, 
in general, the historical approach will be avoided in this text. This is not to downplay the 
role of history, but to avoid many of the confusions that arose in the historical development 
of the subject. As we now have the advantage of twenty-twenty hindsight, we shall use it. 



Chapter 3 


Popular Representations 


To get numerical results from the laws discussed in the previous chapter it is very often 
convenient to use some less abstract “representations” of state vectors and operators. 

We first note (from postulate 0) that operators corresponding to observables are her- 
rnitian and therefore have real eigenvalues (theorem 1.3). Hence, from postulate 3, we see 
that measured quantities will be real as expected. It was shown in theorem 1.5 that the 
eigenstates of a hermitian operator form a complete orthonormal set. This makes it natural 
to expand any state as a linear combination of the eigenstates of an operator corresponding 
to an observable. For example, if Q is such an operator with | qf) as its eigenstates, then an 
arbitrary state |s) can be written as 

(3.1) 

i 

where the components c s , are given by (definition 20 on page 11) 

c S i = (<?*|s)- (3.2) 

Definition 24 The set of components, c S i, of |s) along the eigenstates of the operator Q 
completely describes the state |s) and hence, will be called the Q REPRESENTATION of 
I s). 

Two popular representations are discussed below. 


3.1 The position representation 


The most popular choice of representation is that of the position vector operator R for a 
single particle system. The eigenvalues of R are known to be continuous as every value of 
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position is measurable. The corresponding eigenstates are assumed to be nondegenerate 
for now 1 . Hence, they can be uniquely labeled by the eigenvalues r i.e. |r). Then, from 
equation 3.2, the position representation of the state |s) could be written as the components 

(r|s) = T s (r). (3.3) 

So this set of components can now be seen as values of a function at different positions 
r. This function, T s (r), is conventionally known as the wavefunction because of its wave 
nature in some typical situations. Historically, this representation of the state has been 
the most popular for two reasons. First, the wave nature of this representation had early 
experimental consequences. Second, it will be seen to reduce most problems to differential 
equations. The mathematics of differential equations, including methods of approximation, 
is very well known and makes problem solving easier. 

The wavefunction T s (r) can be seen from another point of view. Applying postulate 4 
for the position operator, one sees that the probability of finding the particle at the position 
r is proportional to 

(r|s)(s|r) = T s (r)4/*(r). (3.4) 

This is usually known as the probability density which when integrated over a finite volume 
gives the probability of finding the particle in that volume. 

We shall now derive the forms of the position and momentum operators and their 
eigenstates in the position representation just defined. We shall do this in one space di¬ 
mension. Extension to three dimensions is straightforward (problem 1). 

The eigenstates of X, the position operator in one dimension, are |x) with correspond¬ 
ing eigenvalues x i.e. 

X\x) = x\x). (3-5) 

For an arbitrary state |s) the result of operation by X is .Xjs). Its position representation 
is (x|X|s). Using equations 3.3 and 3.5 and the hermiticity of X one gets 

(x|X|s) = a:(a:|s) = xT s (x). (3.6) 

This shows that in the position representation the effect of operating by X is just multipli¬ 
cation of the wavefunction by the corresponding eigenvalue x. 

To find the position representation of the momentum operator we note that the rep¬ 
resentations of the operators X (position) and P (momentum) must satisfy postulate 2. 
Hence, as their classical Poisson bracket is 1, we may write 

[X, P] = XP - PX = ih, (3.7) 

X A degeneracy would mean that there are degrees of freedom other than just position. Such internal 
degrees of freedom have no classical analog and can be ignored for now. However, quantum theory allows such 
degrees of freedom and experiments have verified their existence. Hence, they will be discussed separately 
in the later chapter on particle spin. 
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or 

[XP - PX]\s) = ih\s), (3.8) 

for any |s). If the eigenkets of X are |ai), then the position representation of equation 3.8 
is obtained by multiplying on the left by the corresponding eigenbra (a:|. 

(x\[XP-PX}\s) = ih(x\s). (3.9) 

Inserting the identity operator of equation 1.59 in two places in the equation we get 

I I [(x|X|a; , )(a; , |P|a: /, )(a; ,/ |s) — (x\P\x')(x'\X\x")(x"\s)]dx'dx" = ih(x\s). (3.10) 

Using the fact that |ai) are eigenstates of X and the orthonormality of continuous eigenstates 
one obtains 

j J [; x6(x — x')(x'\P\x") — (x\P\x')x"8{x' — x")\{x"\s)dx'dx" = ih(x\s). (3.11) 
Integrating over x' gives 


J (x\P\x")(x — x"){x"\s)dx" = ih(x\s). 


(3.12) 


Using the defining equation 1.63 of the Dirac delta 2 and equation 1.76, one notices that the 
above equation is satisfied by 


(x\P\x") 


6(x — x ) 

in - 

x — x" 


—ih6'(x — x"). 


(3.13) 


Now in the position representation the operation of P on any state |s) would be (inserting 
an identity operator) 

(x|P|s) = J (x\P\x')(x'\s)dx'. (3.14) 

Then using equations 3.13 and 1.74, one obtains 


A A 

(x\P\s) = —ih—(x\s) = -ih—'L s (x). 


(3.15) 


Here a partial derivative is used as the function, in general, depends on time as well. 
Hence, in the position representation the effect of operating by P on a state is equivalent 
to operating the corresponding wavefunction by the differential operator —ihd/dx. 

■^Mathematical note: To use equation 1.63 one must have (*|s) to be strictly a finite function of x. We do 
not know this for a fact even though from the probability interpretation it might seem reasonable. However, 
it is possible to extend equation 1.63 to also apply for a certain class of /(*) that are infinite at some points. 
For example, from equation 1.71 one sees that the f(x) of equation 1.63 itself could be a Dirac delta. For 
fear of getting too deep into mathematics, I shall restrict the discussion to only those wavefunctions (finite 
or infinite) that satisfy equation 1.63 when substituted for /(*). Problem 5 demonstrates how some types 
of infinite wavefunctions cannot be allowed. The mathematically oriented reader might try to solve for 
{x\P\x") from equation 3.12 for a more general class of wavefunctions. 
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It is now possible to find the position representations of the eigenstates of position and 
momentum. The position eigenstates are |x). Their position representation at the position 
x' is, by dehntion, (x’\x). As the position eigenstates must be orthonormal 

(x'\x) = 6(x — x'). (3.16) 

The eigenstates of momentum are | p) (with eigenvalue p) and their position representation 
is, by definition, ( x\p ). From the definition of eigenstates and equation 3.15 

d 

-ih—(x\p)=p(x\p). (3.17) 

The solution is 

(x\p) = Aexp(ixp/h). (3.18) 

Normalization gives the value of A. That is 

6 {p — p') = {p\p) = I (p\x)(x\p')dx 

= j (x\p)* (x\p')dx 

= A* A j exp [ix(p' —p)/h]dx 
= A*A2nh8(p — p'). 

This gives 

A = (27r^) _1//2 exp(ia), (3.19) 

where a is an arbitrary real number that could depend on time. But it has no physical 
significance due to rule 2. Hence, choosing a = 0, equation 3.18 gives 

(x\p) = (27iA) -1 / 2 exp(ixp/h). (3.20) 

3.2 The momentum representation 

Another popular representation is the momentum representation. It is analogous to the 
position representation. The momentum representation of a state |s) would be a function 
of momentum eigenvalues given by the components 

$®(p) = (P\s)- (3.21) 

The effect of operating |s) by momentum P in the momentum representation would be like 
multiplying by p. 


{p\P\s) =p(p\s). 


(3.22) 
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The effect of operating |s) by X would be 

A A 

(j}\X\s) = ih—(p\s) = ih—$ s (p). (3.23) 

The momentum representation of the eigenstates of momentum are 

(p'\p) = 6(p'-p). (3.24) 

The momentum representation of the eigenstates of position are 

(p\x) = (2irh)~ 1/2 ex.p(-ipx/h). (3.25) 


Problems 

1. Generalize equations 3.6, 3.15, 3.16 and 3.20 for three dimensions. 

2. Derive equations 3.22, 3.23, 3.24 and 3.25. 

3. Generalize the results of problem 2 for three dimensions. 

4. For any state |s), show that its momentum representation is a Fourier transform of 
its position representation. [Hint: Use equation 1.59] 

5. If the position representation (wavefunction) of a state |s) goes to infinity (monoton- 
ically) at infinity, show that its momentum representation is infinite for all p. 

6. Consider two arbitrary state vectors |r) and |s). Let their respective position rep¬ 
resentations be H’r(aj) and 4/ s (a;) and their respective momentum representations be 
4> r (p) and 4> s (£>). Show that the inner product (r|s) is given by 

(r|s) = j 'S>*'S/ s dx = I 4>*4> s dp. 



Chapter 4 


Some Simple Examples 


4.1 The Hamiltonian, conserved quantities and expectation 
value 


The observational philosophy of quantum mechanics is so different from that of classical 
mechanics that we need to discuss it in more concrete terms before considering examples. 
From the laws of quantum mechanics (postulate 4) we have learnt that predictions are only 
probabilistic. Hence, given a system in a state |s), the result of a measurement on it will 
in general be different at different times. Furthermore, as a result of the first measurement 
the state of the system might change violently as it has to transform into an eigenstate 
of the operator just measured (postulate 5). What, then, would be the use of such a 
measurement? It seems that a measurement made at any time will say very little about 
later measurements and without such predictive power a theory has little use. 

However, the situation is not that hopeless. Certain measurements can still be pre¬ 
dicted rather well by the quantum theory. For example consider a conservative system 1 
with a hamiltonian (same as energy for our purposes) operator H. The following theorem 
shows that energy measurements in such a system are predictable. 


Theorem 4.1 For a conservative system an energy eigenstate changes with time only by 
a multiplicative factor and hence, stays in the same physical state. 


Proof: Let the eigenstates, | E), of the hamiltonian, H , be labeled by E, the eigenvalues. 

1 Note: This is usually not a restrictive assumption in quantum mechanics as most quantum systems of 
interest are microscopic in nature where all forms of energy loss can be accounted for and included in the 
system to make it conservative. Hence, most of the text will deal with conservative systems and when a 
nonconservative system is to be studied, special care will be taken. 
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As the system is conservative, H has no explicit time dependence and hence, E will 
be time independent. Let the eigenstate | E) change to some state | E) t in time t. \E) t 
is not necessarily an eigenstate of H. From the Schrodinger equation (postulate 1) 

ih^\E) t = //1/•-')/• (4.1) 

The known initial condition at time t = 0 is 

\ E )o = | E) (4.2) 

From equation 4.1 we see that in an infinitesimal time dt after t = 0 the state | E) 
changes to | E)dt given by 

\E)dt = (1 — iHdt/h)\E) = (1 — iEdt/h)\E) 

as E is the energy eigenvalue of | £7). If n such increments in time are made successively 
such that n —> oo, dt —> 0 and ndt = t (finite t). then one obtains 

I E) t = lim (1 - iEdt/h) n \E), (4.3) 

which can be seen to give (see problem 1) 

| E) t = exp(-iEt/H)\E). (4.4) 

So an eigenstate of energy changes in time only by a multiplicative factor which 
means that it remains the same eigenstate and from rule 2, it is evident that there is 
no physical change. This completes the proof. 


Now if a measurement of energy yields the value E, we know from postulate 5, that the 
system collapses into the eigenstate | E). Theorem 4.1 states that once this happens there is 
no more temporal change in the state of the system (unless otherwise disturbed). If another 
measurement of energy is made on the system after some time (with no other disturbance) 
the probability of obtaining a value E’ is given by postulate 4 to be related to \(E'\E)\ 2 . 
From the orthogonality of eigenstates of H, this is seen to give zero probability of E' being 
anything other than E. This is perfect predictability and is restricted by experimental 
errors alone (like in classical mechanics). 

Such predictability of repeated energy measurements makes the hamiltonian a very 
special operator in quantum mechanics. However, for specific problems, one may find other 
observables which have the same predictability in repeated measurements. Such observables 
are called conserved quantities and are defined as follows. 

Definition 25 An observable is a CONSERVED QUANTITY if repeated measurements of 
it at different times result in the same value as long as the system is not disturbed in any 
way between measurements. 
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To identify such observables we shall use the following theorem. 

Theorem 4.2 For a conservative system with Hamiltonian H, an observable Q (with no 
explicit time dependence) is a conserved quantity if and only if [Q,H] = 0 (i.e. Q and H 
commute). 


Proof: We shall first prove that if Q is a conserved quantity [Q,H] = 0. Suppose a 
measurement of Q results in the eigenvalue q. Hence, the resulting eigenstate of Q is 
one of a set of some n 9 -fold degenerate eigenstates with eigenvalue q. This state will 
be labelled as | qi), q giving the eigenvalue and i(= 1 , 2 ,..., n q ) labelling the different 
degenerate states. At a time t after the measurement, the same state will change to 
| qi)t- | qi) can be expanded in eigenstates of H as 


\qi) = J2 a qiE\E). (4.5) 

E 

The sum over all energies E is really a shorthand for sum or integral over all energy 
eigenstates which means that degenerate eigenstates are individually included. From 
the result of problem 2 one can also see that 

\qi)t = J2 a Q iE \ E )t- (4.6) 

E 


Using equation 4.4 this gives 

\qi) t = ^a qiE exp(-iEt/h)\E). (4.7) 

E 

As Q does not depend on time explicitly, its set of eigenvalues are unchanged with 
time. So it is meaningful to say that if Q is a conserved quantity then q should be 

its only possible measured value in the time developed state \qi)t as well. In other 

words, \qi)t must be a linear combination of the n q degenerate eigenstates of Q with 
eigenvalue q. 

n q 

I qi)t = Y^u ij (t)\qj). (4.8) 

i =i 

Hence, | qi) t is an eigenstate of Q at all times and to satisfy equation 4.7 at all times 
the energy eigenstates must also be eigenstates of Q (for degenerate energy eigenstates 
some suitable linear combination may have to be chosen). Thus we conclude that for Q 
to be a conserved quantity, there must exist a complete set of simultaneous eigenstates 
of Q and H. We shall label these eigenstates by the corresponding eigenvalues of both 
operators i.e. \qE) (the labels for distinguishing degenerate states will be suppressed 
for convenience). 
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Now let us expand an arbitrary state |s) as a linear combination of this complete set 
of eigenstates. 

I s ) = J^c gB jqE). (4.9) 

qE 

Then it follows that 

QH\s) = c qE qE\qE) 

qE 

= J2 c q E hQ\qE) 

qE 

= HQ\s). 

Hence, a necessary condition for Q to be a conserved quantity is 


[Q,H]\s) = 0. 

As |s) is an arbitrary state, one may write the condition as 

[Q,H}= 0. (4.10) 


Now it will be shown that equation 4.10 is also a sufficient condition. Once again, 
let | qi), ( i = 1,2, ...,n q ) denote a set of n g -fold degenerate eigenstates of Q with 
eigenvalue q. As Q is not explicitly dependent on time, q will be independent of time. 
Then, for all i 

Q\qi) = q\qi ). (4.11) 


Operating this with H gives 

HQ\qi) = qH\qi). (4.12) 

Given that [Q,H] = 0, this leads to 


QH\qi) = qH\qi). 


(4.13) 


This means that the state H\qi) is also an eigenstate of Q with eigenvalue q. Hence, 
it must be some linear combination of the n q degenerate states i.e. 

n q 

H \Qi) = '52 c ij\<lj)- ( 4 - 14 ) 

3 =1 

The above equation can be used to show that repeated operations (meaning higher 
powers) of H on | qi) will still produce some linear combination of the n q degenerate 
states. This leads to the conclusion that the time development operator of problem 2, 
operated on | qi) will also produce a linear combination of the same n q degenerate 
states i.e. 

n q 

I qi)t = U t (t)\qi) = ^ u ij(t)\qj). 


(4.15) 
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So the time developed state | qi) t is an eigenstate of Q with eigenvalue q at all times. 
Hence, the measured value remains q at all times thus showing Q to be a conserved 
quantity. This completes the proof. 


Conserved quantities are known to be of importance in classical mechanics as they are often 
used to label specific trajectories. Correspondingly, in quantum mechanics state vectors 
are labeled by eigenvalues of conserved quantities (e.g. energy, angular momentum etc.). 
Further, there is a classical result that has the same physical significance as theorem 4.2: 


clq 

dt 


{q, H} + 


dq 

dt’ 


(4.16) 


where q represents the classical observable corresponding to the quantum operator Q. The 
last term, in the above equation, is nonzero only when q depends explicitly on time. In 
quantum mechanics explicit time dependence of observables is uncommon 2 . Hence, the 
vanishing of the commutator brackets in quantum mechanics would classically mean the 
vanishing of the Poisson brackets (postulate 2) giving q to be a classically conserved quan¬ 
tity. One can also prove the following quantum result that looks more like the classical 
relation of equation 4.16. 


Theorem 4.3 If Q is an observable and |r) and |s) are two arbitrary states then 




(■•M) ag 

in dt 


Proof: Using the product rule for derivatives and postulate 1, 


s <r|e|s > = (s (r| ) 0|s> + <r|0 (s |s) ) + (r| ^ |s> 
= (-^)»> + < r "3 

_ , , rl BQ , 

in + dt |s 

This completes the proof. 


m 

in 


j + (r 


dQ 

dt 


s) 


In quantum mechanics theorem 4.3 is not as useful as theorem 4.2 because it does not give 
the actual measured values explicitly. However, theorem 4.3 can be used to find the time 

2 For explicitly time dependent observables, dq/dt A 0- I 11 general, {q, H} depends on the properties 
of the specific system through H , but dq/dt does not. This means {q, H} cannot exactly cancel dq/dt on 
the right side of equation 4.16. So, explicitly time dependent observables cannot be classically conserved 
quantities. In quantum mechanics such nonconserved quantities have limited predictability and thus are of 
lesser importance 
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dependence of the average measured value of any operator. For this we need to define the 
average measured value in quantum mechanics which is the so called expectation value. In 
giving meaning to an average value in quantum mechanics, one has to be careful. Making 
a measurement on a state can change it so radically that making repeated measurements 
over time and then averaging (time average) has no physically useful meaning. Hence, the 
meaning of an average must be that of an ensemble average as stated below. 

Definition 26 The EXPECTATION VALUE (Q) s of an observable Q in a state |s) is 
defined as the average of Q measurements made on a large number (tending to infinity) of 
identical systems all in the same state |s) with no two measurements made on the same 
system. 


Consider a state |.s). For an observable Q, the probability of measuring its eigenvalue q, in 
this state, is known from postulate 4. Hence, using this postulate and the definition of an 
average, the expectation value measured for a large number (tending to infinity) of systems 
all in state |s) would be 


^ T, q g{s\q){g\s) 
E q (s\q)(q\s) 


(4.17) 


where | q) denotes an eigenstate with eigenvalue q and Y) q a sum over all eigenstates (de¬ 
generate states considered individually). Then using equation 1.55 twice 


(Q)s 


J2 q (s\Q\g)(g\s) 

(s\s) 


<g|Q|g) 

(s|s) 


Then theorem 4.3 will give 



([Q,H]) S /dQ\ 
ih \ dt / s 


(4.18) 


(4.19) 


This is a generalization of what is known as Ehrenfest’s theorem. It provides a means of 
comparison of classical and quantum measurements. It is seen that averages of measure¬ 
ments (expectation values) in quantum mechanics obey the classical equations of motion 
given by equation 4.16. This is in accordance with the idea that classical measurements are 
on larger scale objects and hence so inaccurate that only averages of quantum measurements 
adequately agree with them. 


4.2 Free particle in one dimension 


To understand the principles discussed in chapter 2 and to use some of the mathematical 
results obtained in chapter 3 and this chapter, we will study the simplest possible system 
viz. the one dimensional free particle. The classical case of this problem is quite trivial as 
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it would give the solution to be a constant velocity trajectory. In quantum the problem 
is not as trivial and does merit discussion. It is to be noted that for a particle to show 
quantum behavior it must be small enough e.g. an electron. 

The form of the Schrodinger equation tells us that the system is described completely 
by the hamiltonian H. From classical physics the form of the free particle hamiltonian is 
known to be 

H = (4.20) 

where P is the momentum and m the mass. In quantum, P is known to be an operator. We 
shall now try to predict the measurement of three common observables viz. momentum, 
energy and position. 


4.2.1 Momentum 

We already know P has continuous eigenvalues that can take values from minus to plus 
infinity. So if we start with some state |s) the result of a P measurement will be p with 
probability |(p|s)| 2 (postulate 4) if | p) is the eigenstate corresponding to the eigenvalue p. 
As a result of the measurement the system will collapse into the state | p). As an operator 
commutes with itself i.e. [P,P] = 0, it is easy to see that (equation 1.14) 

[P,H} = 0. (4.21) 

Hence, from theorem 4.2, P is a conserved quantity and subsequent measurement of mo¬ 
mentum on this system will continue to give the same value p as long as the system is not 
disturbed in any other way. The state of the system stays | p). 


4.2.2 Energy 

If | E) is an energy eigestate with eigenvalue E then the probability of measuring E in a 
state |s) would be |(E|s)| 2 . As we are considering only conservative systems, energy is of 
course conserved and hence every subsequent measurement of energy will produce the same 
value E as long as the system is not otherwise disturbed. Now it can be seen that | p) is 
also an eigenstate of H (see problem 3). 

H\p) = ^-\p) = ^\p)=E\p). (4.22) 

2m 2m 

Hence, the set of states | p) are the same as the set of states | E). However, we choose to 
label these simultaneous eigenstates with p and not E. This is because, in E, they are 
degenerate. Two states with opposite momenta have the same value for E{= p 2 /2m). 
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In chapter 3 we saw that the position representation of | p) i.e. its wavefunction (for 
fixed time) is 

^ p (x) = (x\p) = Aex.p(ixp/h). (4.23) 

As this is also an eigenstate of energy, from equation 4.4 we see that the time dependence 
of this wavefunction is given by 

x L p (x,t) = Aexp[i(xp — Et)/h\. (4.24) 

This function is seen to be a wave with wavelength 2irh/p and angular frequency E/h. 
Historically, it was this wave form of the position representation of the energy eigenstates 
of a free particle that inspired the name wavefunction. In early interference type experi¬ 
ments this relationship between wave properties (wavelength and frequency) and particle 
properties (momentum and energy) was discovered. 

Now one can see why the position representation has been historically preferred. Ex¬ 
periments like electron diffraction basically make position measurements on some given 
state. By making such measurements on several electrons (each a different system) in the 
same state the probability distribution of position measurements is obtained. And this 
probability distribution is directly related to the position representation of a state as given 
by equation 3.4. 


4.2.3 Position 

The position operator X is not a conserved quantity as it is seen not to commute with the 
hamiltonian. Using equation 3.7 and the properties of commutator brackets, one obtains 

[X.H] = [X.A |=iS h. (4,25) 

Hence, position measurements are meaningful only in certain types of experiments. A 
position measurement on a system can predict very little about subsequent position mea¬ 
surements on the same system even if it is not disturbed in any other way. However, in 
particle scattering type experiments, a position measurement is made only once on each 
particle. In such experiments position is measured for different particles each in the same 
state to obtain information on the probability distribution of position measurements 3 . This 
is the kind of situation we will be interested in. 

If a particle is in a state |s), its position probability distribution is |(rr|s)| 2 . In partic¬ 
ular if |s) is a momentum (or energy) eigenstate, this distribution is |(x|p)| 2 which, from 
equation 3.20, is seen to be independent of x. Hence, a particle with its momentum known 

Tor this, each particle needs to behave like a separate isolated system which means the density of 
particles must be low enough to ignore interactions amongst them. 
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exactly, is equally likely to be anywhere in space! This is a special case of the celebrated 
Heisenberg uncertainty principle (see appendix B). 


To understand the unpredictable nature of nonconserved quantities, it is instructive to 
further analyze this specific example of the position operator for a free particle. So we shall 
see what happens if repeated position measurements are made on the same free particle. 
Whatever the initial state, the first measurement results in a value, say X\ . This collapses 
the state to |rci). Due to nonconservation of position, this state starts changing with time 
right after the measurement. If the first measurement is made at time t = 0, at later times 
t. the state will be |xi) f which can be found from the Schrodinger equation: 

ih^\xi)t = H\xi ) t • (4.26) 

To observe the time development of |a;i), as given by the above equation, it is convenient 
to expand it in energy eigenstates 4 which in this case are the momentum eigenstates | p). 
So, for f = 0we write 

ki) = j (p\xi)\p)dp. (4.27) 

The time development of this state is given by (problem 2) 

I xi) t = j (p\xi)\p) t dp, (4.28) 

where \p)t is the time development of the energy eigenstate as given by equation 4.4. Then 
using equation 3.25 we get 

|xi)i = (27^!l)~ 1/,2 j exp(— ipxi/h) exp(— iEt/h)\p)dp, (4.29) 

where E = p 2 /2m. Hence, at time t the probability of measuring a value x for position is 
given by |(x|a:i)t| 2 where 


(x\xi) t = (2nh) 4 / 2 J exp(—ipxi/h)exp(—iEt/h)(x\p)dp. 
Using equation 3.20 and E = p 2 /2m this gives 


(x\xi) t = {2ttK) 1 j exp 


zi P{X 1 -X) + ^ 


h 


2m 


dp. 


This integral has meaning only in a limiting sense. If 


u(x i — x, t) = j exp 


2 M / \,P t 
ap - - p{x i - x) + — 


h 


2m 


dp, 


(4.30) 


(4.31) 


(4.32) 


4 Expanding in known eigenstates of a conserved quantity is convenient because its time dependence is 
simple. In particular, the time dependence of the energy eigenstates is already known 
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with a > 0, then 


(x\xi)t = (27 rh) 1 lim u(x± — x, t). 

a^O 


Computing the integral in equation 4.32 gives 


u(x i — x, t) = 


2Tiirnh{t + ib ) 


t 2 + b 2 


1/2 


exp 


im(x i — x) 2 t 
2 h(t 2 + b 2 ) 


exp 


mb(x i — x) 2 

2 h{t 2 + 6 2 ) 


(4.33) 


(4.34) 


where b = 2mha. Hence, the probability of finding the particle at x after time t is 

mb(x i — x) 2 


|(2?|aq) t | 2 = lim(27rft) 1 m(t 2 + b 2 ) 1 ^ 2 exp 
6—0 


h(t. 2 + b 2 ) 


(4.35) 


To understand this physically we first consider a nonzero value for b. In that case we notice 
that the probability decreases with time if 


\x\ — x\ < 


H(t 2 + b 2 ) 
2 mb 


1/2 


(4.36) 


and increases at points farther out from X\ . We shall call the right hand side in the above 
inequality the inversion point. With time, the inversion point moves outwards from x±. 
This is sometimes interpretted as probability “flowing” outwards from the initial point X\ 
somewhat like in diffusion. 


If b = 0, equation 4.35 gives unusual results. At t = 0 the probability is still zero 
everywhere other than x = x\. But even an infinitesimal time later, the probabilty becomes 
a nonzero value constant over all space and decreases with time as l/t\ This happens 
because with 6 = 0 the initial delta function wavefunction has infinite momentum (and 
hence, infinite velocity) components in finite amounts. Therefore, parts of the probability 
can go to infinity instantaneously and then get lost giving a decreasing overall probability. 
In the light of special relativity infinite velocity is not possible. This issue can be resolved 
only by introducing a relativistic quantum mechanics as will be done later. 


4.3 The harmonic oscillator 


In both classical and quantum mechanics a common practical problem is that of the behavior 
of a system around an equilibrium point. A classical example of this is a bridge. It is in an 
equilibrium state but for the small oscillations caused by traffic, strong winds or sometimes 
even an earthquake. 

Classically the standard method for analyzing this is to expand the potential energy 
in a Taylor series about the position of equilibrium in terms of all degrees of freedom [2]. 
This is not as complex as it may sound because such an expansion need not have a zeroth 
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order (i.e. a constant) term as the potential is known only upto an arbitrary constant. The 
first order term also vanishes as it has the derivative of the potential (i.e. the force) at the 
equilibrium point as a coefficient. This means that the lowest nonzero term for the potential 
is the second order term which is quadratic in displacement coordinates. Considering only 
small oscillations, one can now ignore all higher order terms. This approximation simplifies 
the problem and still gives useful solutions in many situations. 

A similar problem at the atomic level is that of atoms in a molecule that have equilib¬ 
rium distances between each other. These atoms also can oscillate about their equilibrium 
position. Once again, a small oscillations approximation leads to a quadratic form for the 
potential. However, in the atomic case classical mechanics will be inadequate and quantum 
analysis is required. The results obtained can be verified experimentally. 


As in the classical case, a suitable choice of coordinates can separate the problem into 
several one dimensional problems each with a potential energy given by [2] 

V = lkX 2 , (4.37) 

where k is a constant and X the linear displacement from equilibrium. We shall now 
proceed to solve this one dimensional problem. This system is usually called the harmonic 
oscillator. Using equation 4.37, we write the hamiltonian to be 


p2 p 2 1 

H = — + U = — + - kX 2 , 
2m 2m 2 


(4.38) 


where m is a parameter that is determined by the fact that P is the momentum conjugate 
to X. The determination of m is no different from the classical problem as the hamiltonian, 
in both classical and quantum, is of the same form. For example, for a diatomic molecule 
m = m\m 2 livri\ + m 2 ) with m± and m 2 being the masses of the two atoms. This m is 
called the reduced mass. As P is the momentum conjugate to X , postulate 2 gives 


[X, P] = ifi. (4.39) 

Now it is easy to see from theorem 4.2 that neither P nor X is conserved. The only 
conserved quantity is H. Direct position measurements, like in scattering experiments, are 
not possible as that would mean directly measuring interatomic distances in molecules. 
This makes the measurement of P or X experimentally uninteresting. Hence, we shall 
discuss the measurement of H alone. These measurements are actually made indirectly 
in molecular spectra. Unlike the free particle, this system has a discrete set of energy 
eigenvalues. These eigenvalues, being the only possible results of energy measurements, 
need to be found. 


So the problem at hand is to find all possible values of E such that 


H\E) = E\E) 


( 4 . 40 ) 
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where H is given by equation 4.38 and | E) is the corresponding eigenstate. There happens 
to be no simple recipe for solving such a problem directly. So we shall draw from vector 
algebra where, very often, problem solving is more straightforward in a suitably chosen 
coordinate system. The analog of a coordinate system, in this case, is a representation 
of state vectors. The two representations discussed in chapter 3 are both suitable for our 
purposes. We shall choose the position representation in the following only because of 
historical popularity (see problem 4). 


4.3.1 Solution in position representation 

The position representation reduces equation 4.40 to a differential equation that can be 
solved by standard methods. Using the results of chapter 3, equations 4.38 and 4.40 would 
give 

{-iLw + \ kx<1 ) ue{x)=EMx) ' (4 - 41) 

where ue(x ) = (x\E) will be called the energy eigenfunction. As ue(x) is related to the 
probability of finding the particle at position x, it is reasonable to believe that it cannot 
be infinity at infinite x that is: 

lim ue(x ) < oo. (4.42) 

|cc |—>oo 

This can also be seen from the result of problem 5 in chapter 3 which says that if the 
above condition is not satisfied, the momentum representation is meaningless and hence, 

| E) would not be a state belonging to the set V. Besides, the position representation of 
the momentum operator, used in equation 4.41, was derived in chapter 3 under the same 
condition. So, this boundary condition given by equation 4.42 will have to be imposed on 
the solutions of equation 4.41. 

It is convenient to write equation 4.41 in terms of a dimensionless independent variable 

y = ax, (4.43) 

where a choice of 

a 4 = mk/h 2 , (4.44) 

is seen to simplify equation 4.41 to the following. 

~^jr + ( e ~ V 2 ) u e = 0, (4.45) 

where 

_ 2 o?E _ 2 E /my/ 2 _ 2 E 

k h v k ) hu> ’ 

and oj is the classical angular frequency of natural oscillation. 


(4.46) 



CHAPTER 4. SOME SIMPLE EXAMPLES 


37 


To solve equation 4.45 we first obtain a solution at large \y\ to make it easier to impose 
the boundary condition of equation 4.42. For large \y\, e <C y 2 and hence, equation 4.45 
would give 



(4.47) 


where v is the large \y\ limit of u E - For large \y\, an approximate solution of v is seen to be 


v « exp(±y 2 / 2 ). 


(4.48) 


The positive exponent is not possible due to the boundary condition in equation 4.42. 

Hence, using the negative exponent solution in equation 4.48, we could write a solution of 

ue to be 

ue = H E (y ) exp(-y 2 /2). (4.49) 

Replacing this in equation 4.45 would give the equation for He(]j) to be 

+ (4.50) 

The solution of equation 4.50 can be obtained by the standard series method i.e. He is 
written as a power series in y. 


OO 

H E {y) = y s a iV with °o 0. 

i =0 


(4.51) 


Replacing this in equation 4.50 would give the following conditions on the coefficients a*. 


s(s — l)ao 
s(s + l)ai 

(i + s + 2)(i + s + l)oj _|_2 — (2i + 2s + 1 — e)oj 


As ao 7 ^ 0, from the first equation above, we see that 


0, (4.52) 

0, (4.53) 

0, i = 0,1,2,.... (4.54) 


s = 0 or 1. (4.55) 

We must now analyze the behavior of H E for large \y\ to make sure the boundary condition 
of equation 4.42 is satisfied. The large |y| behavior, obviously, is governed by the large 
powers of y in the series. So, from the above we notice that for large values of i, 

ai + 2 ~ 2cii/i. (4.56) 


A Taylor series expansion of the function exp(y 2 ) shows that its coefficients also satisfy the 
above relation for large powers. Hence, for large \y\, H E behaves like exp(y 2 ). But, from 
equation 4.49, we see this to violate the boundary condition of equation 4.42. However, if 
the series were to terminate beyond a certain power of y. equation 4.56 would be irrelevant 
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and the boundary condition would be satisfied due to the exponential in equation 4.49. 
From equation 4.54 we notice that if 


e = 2j + 2s + l, (4.57) 

for some integer j, then a,j + 2 vanishes and all subsequent coefficients related to it (like aj + 4 , 
a,j + 6 etc.) also vanish. This implies that if j is odd, all o’s with odd indices higher than j 
vanish and if j is even, all a’s with even indices higher than j vanish. Hence, if j is odd the 
series will still have large powers from the even indexed terms that will not vanish as they 
are all related to ao (from equation 4.54) and ao is chosen to be nonzero in equation 4.51. 
So, for the complete termination of the series, j must be even to ensure the termination 
of the even indexed terms and a± must be chosen to be zero to ensure the termination of 
the odd indexed terms. This shows that He must be an even polynomial if s = 0 and an 
odd polynomial if s = 1. Now, from equations 4.46 and 4.57, we find that He is a suitable 
solution only if 

E = hu(j + s + 1 / 2 ). (4.58) 

If n = j + s, then n is odd if s = 1 and it is even if s = 0. This is because j is always 
even. Hence, He is an odd or even polynomial depending on whether n is odd or even. 
The corresponding energy eigenvalues are 

E = (n+ l/2)hu, n = 0,1,2,... (4.59) 


This is seen to be a discrete set. It is this kind of discontinuity of possible observed values 
that attracted attention in early investigations and inspired the name “quantum”. It is to 
be noted that zero energy is not possible. The lowest, or the so called ground state, energy is 
hio/2. However, this energy is in no way measurable. In actual spectroscopic measurements 
only the differences in energy of pairs of states are measured when the system jumps from 
a higher to a lower energy state releasing a photon (particle or quantum of electromagnetic 
radiation) carrying the energy difference. 


The solutions for He are very often labelled by the integer n of equation 4.59 rather 
than by E i.e. He = H n . The H n are called the Hermite polynomials in mathematics. 
Properties of the H n are to be found in standard texts [3]. The property which is most 
useful comes from the orthonormality of eigenstates i.e. if we label the states | E) also by 
the integer n and call them | n) and correspondingly, ue is called u n then 


or 


8 n i = (n\l) = j (n\x)(x\l)dx = j u^uidx 
Srd = 0 C 1 I H*Hi exp(~y 2 )dy. 


(4.60) 


Here we have used equations 4.43 and 4.49. The orthonormality condition is used to find 
the ao coefficient for each H n . 
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4.3.2 A representation free solution 

The previous section illustrated a “brute force” method of solving a problem. Such a 
method is very useful when one has no guidelines for approaching a problem. However, 
sometimes (and only sometimes) long periods of reflection can reveal more elegant solutions. 
This happens to be true for the harmonic oscillator case. In the following, I shall present 
this elegant, representation free, solution. 


For convenience, let us first define the following dimensionless quantities proportional 
to position and momentum respectively. 



Q = (muj/hy^X 

(4.61) 


K = (mlvw)~ l / 2 P 

(4.62) 

From equation 4.39 one sees that 


[Q,K] = i, 

(4.63) 

and from equation 4.38 

H -- 

= hco(K 2 + Q 2 )/2 = hcuG, 

(4.64) 

where 


G = (K 2 + Q 2 )/ 2, 

(4.65) 


is a dimensionless operator proportional to H. As G is the sum of two squared hermitian 
operators, one might expect that all its eigenvalues g are positive. This of course needs to 
be proved. The following theorem will lead to the result. 


Theorem 4.4 For any arbitrary state |s), the following is true. 

(s|G|s) > 0 


Proof: Let 

Its hermitian adjoint is 


= 2 ~ l / 2 (K-iQ). 


= 2 ~V 2 (K + iQ). 

Then, using equations 4.63 and 4.65 

a)a = G + i[Q, K\/2 = G — 1/2. 

Now, if |r) = o|s), then from rule 5 of state vectors 

q < ( r | r j _ ( s | a t a | s ) = (s|G|s) — (s|s)/2 

Hence, 


(4.66) 

(4.67) 

(4.68) 


This completes the proof. 


(sjGjs) > 0 
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In the above theorem, if |s) were replaced by one of the eigenstates of G , it would follow 
that the corresponding eigenvalue 

g> o. 


The eigenstates | n) of the operator 


N = a) a, 


(4.69) 


are the same as those of G , as G and N differ by a constant number. They are labelled by 
the eigenvalues n of N. Also, theorem 4.4 can be seen to show that 


n > 0. (4.70) 

From equation 4.63, the operators a and al can be seen to have the following commutator 

[a, a)] = 1. (4-71) 

Now consider the state a\n) where | n) is the energy eigenstate corresponding to the eigen¬ 
value n of N. Using equation 4.71 we get 

Na\n) = Gdaa|n) = (aa' — l)a|n) 

= (aN — a)|n) = (an — o)| n) = (n — l)a|n). 


This shows that a\n) is also an eigenstate of N with eigenvalue n — 1. Consequently, a is 
called the lowering operator as it lowers an eigenstate to another with eigenvalue less by 
one that is 

a|n) = c n |n — 1), (4.72) 

where c n is a complex number. Similarly, a) can be seen to be the raising operator. 

Na^\n) = a)aa)\n) = a^(a^a + l)|n) 

= a^(n + l)|n) = (n + l)a)\n). 


Hence, 


a)\n) = d n \n + 1), 


(4.73) 


where d n is a complex number. Now, due to equations 4.70 and 4.72, if any eigenstate is 
repeatedly operated on by the operator a, then after a finite number of steps one obtains 
the state with the lowest n (say no), no must be less than 1 as otherwise it would be 
possible to lower it further. Hence, from equation 4.70 


0 < n 0 < 1. (4.74) 

As |r&o) is the lowest eigenstate, further lowering by a should lead to zero. 

a|n 0 ) = 0. (4.75) 


Hence, using the definition of eigenstates, 

noK) = N \ n o) — a)a\no) = 0. 


(4.76) 
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Consequently, no = 0 and as all higher values of n differ by positive integer values, they must 
all be positive integers. This gives the complete set of eigenstates to be | n) (n = 0,1, 2,...) 
(see problem 6). Using equations 4.68 and 4.69, one can find the eigenvalues of G. 

G\n) = (N + l/2)|n) = (n + l/2)|n), (4.77) 

and hence, from equation 4.64, the energy eigenvalues are found to be 

E = (n + l/2)huj, (4.78) 

which is the same result as found by the earlier method. 

It is to be noticed that here the eigenstates are not given in any functional form as 
they have no direct observational consequence. However, for future use the constants c n 
and d n need to be found. Finding the norm of both sides in equation 4.72 we get 

|c n | 2 (n — l|n — 1) = (n\c^a\n) = (n\N\n) = n(n|n). (4.79) 

As all eigenstates are normalized, this would mean c n = \Jn and 

a\n) = \/n\n — 1). (4.80) 

Similarly, from equations 4.73 and 4.71 

\d n \ 2 (n + l\n + 1) = (n|aeJ|n) = (n|(a^a + l)|n) 

= (n\(N + l)|n) = (n + l)(n|n). (4.81) 

This gives d n = sjn + 1 and 

a)\n) = \Jn + l|n + 1) (4.82) 

It is also possible to relate the states | n) to their position representations by using equa¬ 
tion 4.80. As u n = (x\n), 

(x|a|n) = \fn(x\n — 1) = \/nu n _i. (4.83) 

Then, using equations 4.61, 4.62 and 4.66 

(a:|(2ma;^) _1 / 2 [P — imuX]\n) = \fnu n -\. (4.84) 


Using the position representations of X and P, this gives 


/ h \ l t 2 du n ( muA 1 / 2 

\2muj) ~d^ + {^h) XUn 


i\fnu n _\. 


(4.85) 


This equation can be used to find all the u n as we know that U -1 = 0. Here the boundary 
condition of equation 4.42 is seen to be automatically satisfied. 
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4.4 Landau levels 


An electron in a uniform magnetic field is another simple system which is of considerable 
practical interest. In particular, in some solid state materials the electron is restricted to a 
surface and the magnetic field is applied perpendicular to it. In such a system the quantum 
nature is reflected in Hall effect measurements at low temperatures and high magnetic 
fields. The electron in this two dimensional system has discrete energy eigenvalues just like 
those of the harmonic oscillator. These energies are known as the Landau levels. 

To find the Landau levels we first write down the hamiltonian of the system in SI 

units. 

H = ^~{ P + eA) 2 , (4.86) 

2m 

where P is the two dimensional momentum, m the electron mass, e the magnitude of the 
electron charge and A the vector potential due to the uniform magnetic field. We shall 
choose the x-y plane to be the surface to which the electron is restricted. So the uniform 
magnetic field will be in the z direction. In a simple gauge choice the vector potential for 
this field would be given by 


A x = 0, Ay = BX, A z = 0. (4.87) 

where B is the magnitude of the magnetic induction and X is the x component of the 
position operator. Hence, equation 4.86 gives 

H =^[ p2 x + ( p y + eBX ) 2 ]. (4.88) 

At this stage we take a second look at the representation free solution of the harmonic 
oscillator problem. It can be seen that the problem is completely specified by the hamil- 
tonian in equation 4.64 and the commutator in equation 4.63. If the operators K and Q 
were replaced by any two operators with the same commutator, the solutions would remain 
the same. From inspection it can be seen that the present problem can be transformed to 
look exactly like equations 4.63 and 4.64 and hence, the solutions would be the same. The 


necessary transformation to dimensionless variables is 

Q x = {eBh)~ 1/2 P y + (eB/h) 1/2 X, (4.89) 

K x = (eBhy^Px, (4.90) 

Q y = (eBhy^Px + ieB/h) 1 ^, (4.91) 

K y = ( eBh)~ 1/2 P y , (4.92) 

where Y is the y component of the position operator. So 

[Qx, K x \ = [Q y , K y \ = i, (4.93) 
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and all other possible commutators vanish. The hamiltonian would then be 


H = 


eBh 
2 m 


[Kl + Ql\. 


(4.94) 


By putting 


K = K x , Q = Q x , oo = eBjm 


(4.95) 


in equations 4.63 and 4.64 we see them to be the same as given in equations 4.93 and 4.94. 
Hence, the corresponding energy eigenvalues (i.e. the Landau levels) must be 


E = (n + l/2)heB/m, n = 0,1,2,.... 


(4.96) 


Here eB/m is seen to be the classical cyclotron frequency. These energy levels have been 
indirectly observed in quantum Hall effect measurements [4]. 


Problems 


1. 

2 . 


Derive equation 4.4 from equation 4.3. 

Show that the time development operator is 

U t {t) = exp(—iHt/h) 

i.e. a state |s) at zero time develops to |s)t at time t where 

|s)t = U t (t)\s). 

Hint: An analytical function of an operator is defined by its Taylor series i.e. 

H n d n f(x ) 


m = E 


71=0 


n\ 


dx r 


I x=0 


3. Prove that two observables Q and P can have simultaneous eigenstates if and only if 
[Q. P] = 0. [Hint: Part of the proof can be found in the proof of theorem 4.2.] 

4. Find the energy eigenstates and eigenvalues for the harmonic oscillator in the mo¬ 
mentum representation. 

5. Find the Hermite polynomials, H n , for n = 0,1, 2, and 3. For the same values of n, 
show by direct integration that the u n are mutually orthogonal. Find ao in each of 
the four cases by normalizing according to equation 4.60. 

6. Show that noninteger eigenvalues are not possible for the number operator N. [Hint: 
Assume such an eigenstate to exist and see what happens on repeatedly lowering it 
by using a.] 
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7. For n = 0,1, 2, and 3, find u n using equation 4.85. 

8. For the Landau level problem find the raising and lowering operators in terms of 
momentum and position operators. Then find the equivalent of equation 4.85 in this 
case and solve for wavefunctions of the four lowest levels. 



Chapter 5 


More One Dimensional Examples 


The examples of quantum systems presented in chapter 4 gave some hint as to what to 
expect in quantum mechanics. The next step will be to understand general properties of 
solutions of some physically interesting classes of problems. Then we shall go through some 
oversimplified examples to illustrate these general properties. In this chapter we shall work 
with only a single particle in one space dimension. Extension to three dimensions will 
be later seen to be quite straightforward (chapter 8). However, extension to multiparticle 
systems, in general, is more tricky and will not be discussed in this text. 


5.1 General characteristics of solutions 


In chapter 4 we saw that energy as an observable has some special importance. In particular, 
being conserved in closed systems, it can be used to describe the states of such systems. 
Hence, we would want to identify the energy eigenstates and the corresponding eigenvalues 
of any conservative system. The relevant equation would be 


H\E) = E\E), 


(5.1) 


where H is the hamiltonian, | E) an eigenstate of H and E the corresponding eigenvalue. 
Equation 5.1 is often referred to as the time independent Schrodinger equation due to its 
similarity to equation 2.1 (the time dependent Schrodinger equation). The total energy 
operator i.e. the hamiltonian of a single particle is derived from its classical form to be 


P 2 

H =tt- + V(X), 


(5.2) 


where P and X are the momentum and position operators respectively and m is the mass 
of the particle. V, the potential energy, is assumed to be a function of position alone. 
However, a more general form of V is not difficult to handle. 
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It is to be noted that the kinetic energy term (P 2 /2m) in H is the same for all single 
particle one dimensional problems and hence, it is V, the potential energy, that characterizes 
a specific problem. In most practical problems, V is found to be a function of position. 
This makes the position representation a natural choice 1 . Hence, we write equation 5.1 in 
the position representation as follows. 


Hue{x ) = Eue(x), 


(5.3) 


where ue(x ) = (x\E) is the position representation of | E) and the position representation 
of H can be seen to be 


H = - 


n 2 d 2 


+ V(x) 


(5.4) 


2m dx 2 

As in equations 5.3 and 5.4, in the future we shall use the same symbol for both an operator 
and its representation. This does not cause any confusion as long as the representations of 
the state vectors are suitably labelled (as in equation 5.3). 


Equation 5.3 is seen to be a differential equation and should not, in general, be difficult 
to solve if V(x) is given to be some physically plausible potential energy function. But before 
solving specific problems we shall study the general characteristics of solutions for certain 
classes of V(x). 


5.1.1 E < V(x) for all x 

In classical physics we know that E < V has no meaning as it would give a negative 
kinetic energy and hence, an imaginary velocity. However, in quantum mechanics we have 
already seen nonzero values of the wavefunction ue(x) in regions of space where E < V (the 
harmonic oscillator as discussed in chapter 4). This would result in nonzero probabilities 
of finding the particle in such a region. Such an unexpected result prompts us to be more 
cautious in dealing with quantum mechanics. Consequently, we shall first consider the 
extreme case of E < V for all x. Fig. 5.1a shows such a potential energy and fig. 5.1b 
shows the possible wavefunctions. Consider an arbitrary point xq. As the overall sign of 
u E (x) can be chosen arbitrarily, we can choose ue(x o) to be positive. From equations 5.3 
and 5.4 we get 

d 2 2ni 

■^2 u E {x) = -^-[V(x) - E]ue(x). (5.5) 

As the second derivative is related to the curvature, we notice from equation 5.5 that at 
xq the curvature is positive. Consequently, it could not decrease in both directions. In a 
direction that it increases it will stay positive and hence, from equation 5.5, continue to 
have a positive curvature. This would make it increase to infinity. But such a solution is 
not allowed (problem 5 of chapter 3). So the conclusion is that E < V for all x is not 
possible for any system. 

Hater we shall see that the description of internal degrees of freedom of a particle (e.g. spin) needs a 
different representation. 
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Figure 5.1: Potential energy and wavefunction for E < V(x) case. 

5.1.2 Bound states 

This is the case where E > V(x) only in some regions of finite x and E < V(x) for |a:| —> oo. 
States satisfying these conditions are called bound states because the corresponding classical 
case is that of a particle restricted to a certain region of space. The harmonic oscillator is 
an example where these conditions are satisfied for all finite values of E. Fig. 5.2a shows 
a somewhat arbitrary example of such a potential. Given the nature of the potential, one 
can locate two points, X\ and X2, such that E < V{x) for x < x± and x > X2- 

We shall now consider the behavior of ue(x) for this potential (fig. 5.2b). As in the 
last section one can choose the function to be positive at X 2 - Then from equation 5.5 one 
concludes that for all x > X 2 the curvature is positive if the function remains positive. 
This allows three possibilities for the value of the function at infinity. First, it could curve 
upwards and go to infinity. Second, it could continue to curve upwards but have a negative 
slope all the way upto infinity without changing sign. This would require the function to 
go to zero at infinity. Third, the function could change sign even while it curves upwards. 
But once it changes sign equation 5.5 would require it to have a negative curvature. This 
would force it to go to negative infinity. Only the second possibility is allowed due to 
reasons given earlier. Equation 5.5, being a second order differential equation, allows two 
initial conditions. One can choose these conditions to be the values of the function and its 
derivative at X2 (ue(x2) and u' E (x 2 )). The choice can be made to ensure that the second 
of the above three possibilities is true. 

Next we consider the behavior of the function to the left of X 2 - In the regions where 
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(b) 


Figure 5.2: Potential energy and wavefunction for bound state case. 


E > V, using equation 5.5 once again, one finds the curvature to be negative (as long as the 
function is positive). However, if it is not sufficiently negative, it might still go to infinity 
at negative infinity as the curvature would be positive again for x < X\. The values of E 
that will yield such a result are not allowed. If E is increased to some critical value (say 
Eq) the curvature of the function would be sufficiently negative between x\ and to make 
it go to zero at negative infinity. However, if E is increased beyond Eq the curvature is so 
highly negative that it makes the function drop below zero. Once, the function is less than 
zero equation 5.5 would give it a negative curvature for x < x± which would make it go to 
negative infinity at negative infinity. This is not allowed. This shows that Eq is an allowed 
energy but energies in its immediate nieghborhood are not. Eq is seen to be the lowest 
allowed energy and hence, it is called the ground state energy and the corresponding state 
is called the ground state. If one continues to increase E beyond Eq, the function will drop 
below zero between x\ and X 2 and hence produce a positive curvature between these points. 
If the curvature is sufficiently positive it can pull up the function from negative infinity to 
zero for x —> —oo. The critical energy E± at which this happens is the next allowed energy 
(the first excited state). Energies immediately beyond E± will once again be disallowed as 
they would send the function to positive infinity at x = —oo. A repetition of this argument 
shows that the allowed energies are a discrete sequence of energies (Eq, E\, E 2 , ...). The 
subscript of the allowed energy can be seen to correspond to the number of times the 
corresponding wavefunction (eigenfunction) changes sign. This discrete nature of the set of 
eigenvalues (which are the observed values) is unique to the quantum mechanics of bound 
states and has no parallel in classical mechanics. 

Experimental observation of these eigenvalues is a little indirect. A general approach 
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Figure 5.3: Potential energy and wavefunction for scattering state case. 


is to give an ensemble of the system under study (e.g. a large collection of identical atoms), 
a random amount of energy (maybe heat). This would make different members of the 
ensemble (atoms) rise to different energy levels (eigenvalues). Then, by mechanisms that 
will be discussed later, each member “jumps” to lower energy levels by emitting the balance 
of energy in some form (usually electromagnetic particles called photons). Hence, the 
emitted energy gives the difference between initial and final energies. Different members of 
the ensemble will have different initial and final energies and would emit different amounts 
of energy. An analysis of these emitted energies (photons) can then verify the eigenvalues 
of the system. Specific examples will be discussed later. 


5.1.3 Scattering states 

This is the case where E > V(x) at x = —oo, or x = +oo or both. In addition, one 
may use the fact that for most physical cases E and V are known to be finite at infinity. 
The corresponding states are called scattering states as classical scattering problems have 
E > V at large distances from the scatterer. For bound states one noticed that the function 
had to go to zero at large distances in both directions to prevent it from going to infinity. 
In the present case we shall see that at least in the direction that has E > V at infinity, no 
such condition is necessary. 

Consider, for example, that E > V for all x > X 2 (fig. 5.3a). Then, from equation 5.5, 
one concludes that for positive values of the function the curvature is negative and vice 
versa. This forces the function to curve down when it is positive and curve up when it is 
negative (fig. 5.3b). As a result, the function becomes oscillatory and does not go to infinity 
for any value of E as long as E > V at infinity. Hence, there are no disallowed states as 
long as E > V at infinity. If E > V for all x < aq, the solution becomes oscillatory in 
that region too. It can be seen that there are no disallowed energies even if the solution is 
oscillatory only at one of the infinities. This is because the function can be held at finite 
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Figure 5.4: Potential energy and wavefunction for E < V(x) at x = +oo and E > V{x) at 
x = —oo. 

values in one direction simply by an appropriate choice of initial conditions (fig. 5.4). 

The scattering states are often said to have a “continuous spectrum” (see definition 17 
on page 8) of energies and the bound states are said to have a “discontinuous spectrum” or 
a “discrete spectrum” (see definition 16 on page 8) of energies. These terms are due to their 
relation to the electromagnetic emission or absorption spectra of materials (chapter 8). A 
system can often have both bound and scattering states in different ranges of E (e.g. the 
hydrogen atom). 

For scattering states theoretical prediction of possible energies is trivialized by the 
continuous nature of the spectrum. However, computations of probabilities of scattering of 
a particle in different directions is meaningful and nontrivial. For bound states theoretical 
prediction of possible energies is nontrivial due to the discrete nature of the spectrum but 
scattering probabilities have no meaning as no particle can escape to infinity (i.e. scatter). 

Consequently, for scattering states we need to study the meaning of scattering exper¬ 
iments as pertaining to quantum mechanics. A standard experimental situation is that of 
a beam of particles impinging on a target and then scattering in all directions (fig. 5.5). 
Although at present we are discussing only one dimensional problems, for future use, the 
following general analysis of such scattering processes will be in three dimensions. 

It is intuitive to conclude that the information on scattering probabilities in different 
directions must be contained in the wavefunction as it gives the probability of finding the 
particle at some given position. However, it must be noted that the wavefunction, as 
discussed till now, describes a single particle and not a whole beam of particles. Hence, 
it can provide information about the scattering of the beam only if all particles of the 
beam are in the same state and the beam is not dense enough to require the consideration 
of interparticle forces. All forces (or potential energies) are due to the target. Under 
these conditions we need to find a relation between the single particle wavefunction and 
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incident beam 



Figure 5.5: A particle scattering experiment. 


the measured scattetering probabilities for a beam of particles. The measure of scattering 
probabilities is called the scattering cross section and it is defined as follows. 


Definition 27 The (DIFFERENTIAL) SCATTERING CROSS SECTION is defined as 


a = 


1 dn p 
N p dw 


(5.6) 


where dn p is the number of particles scattered per unit time into the infinitesimal solid angle 
duj (measured with the target as center) and N p is the number of incident particles per unit 
time per unit cross sectional area of the incident beam. 


a would of course be a function of the direction (often conveniently given by the 3- 
dimensional polar angles ( 6,(f> )) in which the measurement is made. dn p would have to 
be measured with some particle detector placed in that direction for a known N p . The 
target is usually small enough compared to its distance from the detector to be considered 
a single point. Let S p denote the particle current density at any point in space. Then for 
the incident beam 

\S P \ = N p , (5.7) 

and for scattered particles (in 3-dinrensions) near the detector 

dn p = \S p \r 2 du>, (5.8) 

where r is the distance from the target to the detector and hence r 2 dca must be the surface 
area of the particle detector. As particle number is conserved, the particle current density 
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must obey the following conservation equation (see problem 1). 


V-S p + ^ = 0, (5.9) 

where p p is the particle density. We can also show that a similar relation holds for the 
single particle probabilty density p = where T is the single particle wavefunction (i.e. 
the position representation of the state vector). 


dp 

dt 


d d'L 

— (\&*\b) = u»*__ + 

dV 1 dt 


d\S>* 

~dP 


(5.10) 


The position representation of equation 2.1 is 



H 2 9 

- —V 2 T + ET 
2m 


(5.11) 


where the three dimensional hamiltonian H has been replaced by its position representation 


P 2 

H= 2^ + V = 

The complex conjugate of equation 5.11 is 


n 2 

2m 


V 2 + V. 


-ih 


<9T* h 2 „ 

- =-V 2 T* + v^* 

dt 2m 


(5.12) 


(5.13) 


Now multiplying equation 5.11 by T* and equation 5.13 by T and then subtracting the 
two, one obtains from equation 5.10 

^ —(TV 2 T* - 'TV 2 T) 

ut 2 tyI/ 

jj 7 

= V • (^V^* - (5.14) 


Hence, 

V-5+^ = 0, (5.15) 

where 

%Ti 

s = — (fVf* - rvf). (5.16) 

Equation 5.15 can be seen to look exactly like equation 5.9. Moreover, under the present 
assumption of all particles of the beam being in the same state and there being no interaction 
among them, it can be seen that the particle density of the beam must be proportional to 
the single particle probability density i.e. 


Pp — &P 


( 5 . 17 ) 
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for some constant a. Hence, from equations 5.9 and 5.15 it can be concluded that 


Sp — aS , 

(5.18) 

and from equations 5.7 and 5.8 this would give 


N p = q:N. cln p = adn , 

(5.19) 

where 


N = \S |, dn = \S\i' 2 du>, 

(5.20) 


near the source and the detector respectively. Now from equations 5.6 and 5.19, we can 
rewrite a in terms of quantities depending only on the single particle wavefunction. 


a = 


1 dn 
N dui 


(5.21) 


For the one dimensional case, there are only two directions of scattering (forward and 
backward), dn/dto in the forward direction would simply be the forward scattered (or 
transmitted) current n t and in the backward direction it would be the reflected current n r . 
N, the incident current density, will be replaced by the incident current, because in one 
dimension the cross sectional area of the beam has no meaning. Hence, a would have the 
meaning of a transmission and a reflection coefficient (T and R) given by 


T = nt/N, R = n r /N. 


(5.22) 


5.2 Some oversimplified examples 


Before working out some standard examples, I shall state and prove two useful theorems. 
Theorem 5.1 will not be directly useful in working out problems, but it will explain why 
most energy eigenfunctions are seen to be real. 


Theorem 5.1 Any eigenfunction of energy (i.e. a solution ue{x) of equation 5.3) can be 
written as a linear combination of real eigenfunctions of energy. 


Proof: Consider an eigenfunction i.e. a solution f(x) of equation 5.3 that is 

Hf{x) = Ef{x). (5.23) 

As the differential operator H and the eigenvalue E are both real, the complex con¬ 
jugate of equation 5.23 would be 


Hf*(x) = Ef*(x). 


( 5 . 24 ) 
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Hence, f*(x) is also an eigenfunction. Consequently, the two real functions 

'M x ) = fi x ) + f*( x ), 'M x ) = i[f( x ) ~ f*i x )], (5-25) 

are also eigenfunctions. Now f(x) can be written as a linear combination of these 
real eigenfunctions: 

fix) = [«i(x) - iu 2 (x)\/ 2. (5.26) 

This proves the theorem. 

Theorem 5.2 An energy eigenfunction and its first derivative are continuous if the poten¬ 
tial energy V is finite. 

Proof: Let us integrate equation 5.5 as follows (for positive e). 

rx+e rftii(r) ‘1m r x+t 

/ — J~T^ dx = - T / \ E ~ V(x)}u(x)dx, (5.27) 

J x—t dx fl Jx—t 

where the subscript E of the eigenfunction is omitted. E and V are given to be 
finite and u can be shown to be finite (see problem 2). Hence, the right hand side of 
equation 5.27 goes to zero in the limit e —> 0. This gives 


This proves the continuity of the derivative of u. Next consider the double integral 
of equation 5.5 with the following limits. 

rx"+e I" ex' rfi v ( T ) 1 O m rx"+e \ ex' 

/ / —r^dx dx' = -=9 / / [E-V(x)]u(x)dx dx'. (5.29) 

■Jx"-e [JO dx z J fi J x "-e [ JO 

Performing the x integral on the left side gives 

ex"+E 1 O m ex"+e I" ex' 

/ —— - K dx'= - 7T- / / [E — V{x)]u{x)dx dx', (5.30) 

Jx"-e L dx' J h Jx"-E [-'0 

where K is the constant value of the derivative of u at x = 0. Once again, as E, V 
and u are finite, the right hand side of equation 5.30 has a limit of zero when e —> 0. 
Similarly, K (on the left hand side) can be seen to be finite and hence its integral 
goes to zero in the limit of e —> 0. This leaves us with the following. 

lim[rt(o: // + e) — u(x" — e)] = 0. (5.31) 

Hence, u is found to be continuous. This completes the proof. 

Now we are ready to solve some illustrative examples. The potentials used in these examples 
will be oversimplified to bring out the qualitative aspects with very little mathematical 
manipulations. 


(5.28) 


,. du 
iim — 

e—>o dx 


-t =0. 
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(5.36) 
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A general solution of this equation is 


■u = Cexp(— Kx) + Dexp(Kx), 

(5.37) 

where C and D are arbitrary constants and 


K = +[2m(V 0 -E)/h 2 ] 1 / 2 . 

(5.38) 

As u cannot be allowed to go to infinity for |a:| oo, 


u = C'exp(— Kx), for x > a, 

(5.39) 

and 

u = Dexp(Kx), for x < —a. 

(5.40) 


To determine the four unknown constants A, B , C, and D of equations 5.34, 5.39, and 5.40 


we use the continuity conditions of u and du/dx 
and x = — a. This gives 

(theorem 5.2) at the two boundaries 

x = a 

Asin(fca) + B cos(ka) 

= C'exp(— Ka), 

(5.41) 

Akcos(ka) — Bks'm(ka) 

= —CKexp(—Kci), 

(5.42) 

—Asin(fca) + B cos(ka) 

= D exp(— Ka), 

(5.43) 

Akcos(ka) + Bksin(ka) 

= DK exp (—K a ), 

(5.44) 


These are four homogeneous linear algebraic equations for the four constants A, B, C, and 
D. Hence, a nonzero solution will exist only if the following determinant vanishes. 

sin (ka) cos (ka) — exp (—Ka) 0 

kcos(ka) —ksin(ka) Kexp(-Ka) 0 

— sin(A;a) cos (ka) 0 —exp (—Ka) 

kcos(ka) k sin (A: a) 0 — A'exp(— Ka) 

A direct solution of equation 5.45 is a little tedious. However, a slight manipulation of 
equations 5.41 through 5.44 can produce the same results more readily i.e. 


2 Asin(fca' 

) = (C — D) exp(— Ka), 

(5.46) 

2Akcos(ka) 

) = —(C — D)Kexp(—Ka), 

(5.47) 

2 B cos (ka] 

) = (C + D) exp(— Ka), 

(5.48) 

2Bk sin(A:a’ 

) = (C + D)K exp(— Ka). 

(5.49) 

One kind of solution of these four equations is given by 

A = 0, C = D, B/C = exp(— Ka)/cos(ka), 

(5.50) 

and 

k tan (A: a) = K. 

(5.51) 
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The other kind of solution is given by 

B = 0, C = — D, A/C = exp(— Kd)j sin(fca), (5.52) 

and 

/.' cot (/.'«) = —A'. (5.53) 

It is easily seen that the first kind of solution is symmetric in x and the second antisymmet¬ 
ric. The conditions in equations 5.51 and 5.53 can be seen to be solutions of equation 5.45. 
These conditions are responsible for the discreteness of the set of eigenvalues as discussed 
in subsection 5.1.2. Solving these equations to find the allowed values of E is not possible 
in a closed form. However, numerical solutions can be readily obtained. To this end it is 


convenient to write the equations in terms of the dimensionless parameter £ 
equations 5.51 and 5.53 would become (using equations 5.35 and 5.38) 

= kci. Then 

£ tan £ = (y 2 -£ 2 ) 1/2 , 

(5.54) 

and 


£ cot £ = -( 7 2 -£ 2 ) 1/2 , 

(5.55) 

where 


7 = (2mVoa 2 /H 2 ) 1 / 2 . 

(5.56) 

Once a solution for £ is found, equation 5.35 will give 


E = h \ 

2ma z 

(5.57) 


Consider equation 5.54 first. The tan(£) function being periodic there is a possibility of 
multiple solutions. By definition £ is positive. In each interval of £ given by (ro+l/2)7r < £ < 
(n + l) 7 r (n a non-negative integer) there are no solutions as tan(£) is negative and the right 
hand side of equation 5.54 is positive. In each interval of £ given by nir < £ < (n + 1/2)7 t 
( n a non-negative integer) there can be at most one solution as in these intervals the left 
hand side increases and the right hand side decreases (fig. 5.7). This fact can be used to 
numerically approximate the solution by the two point bisection method. 

The bisection method involves choosing an interval in which one and only one solution 
exists. Let [£^,£ t ] be such an interval and let £, the solution, belong to this interval. In 
the present case such an interval would be [nir, (n + l/2)ir\ where 7 > nir and n is a non¬ 
negative integer. The midpoint £1 = (£& + £*)/2 is chosen as the first trial solution. Using 
equation 5.54, if one finds that £1 < £ then £1 is taken to be the new lower bound £*, and 
if £1 > £ then it is taken to be the new upper bound £ t . This process shrinks the size of 
the interval while ensuring that the solution is still within it. Repeating the process can 
reduce the size of the interval to that of tolerable error and then £& (or £ t ) could be accepted 
as the solution £. A listing of a computer program implementing this process is given in 
appendix A. 



CHAPTER 5. MORE ONE DIMENSIONAL EXAMPLES 


58 



Figure 5.7: Rectangular potential well - graphical solution. 


Solutions of equation 5.55 can be abtained in a similar fashion. Once the energy 
eigenvalues are known, equations 5.50 and 5.52 would give the corresponding eigenfunctions 
to be in agreement with the qualitative discussions of subsection 5.1.2 (see problem 3). 

The solutions of equations 5.54 and 5.55 become much simpler in the limit of Vo —> 0. 
The corresponding eigenfunctions are also simpler (see problem 4). 


5.2.2 Rectangular potential barrier (scattering states) 

The one dimensional rectangular potential barrier is given by the following potential (fig. 5.8). 


V(x) 


Vo for 0 < x < a 
0 otherwise 


(5.58) 


From the discussion in subsection 5.1.2 we conclude that this potential does not allow any 
bound states. Hence, all possible states are scattering states. This requires that the form of 
the incident beam (i.e. its wavefunction) be known from the experimental setup. In most 
scattering experiments the incident beam has a well-defined momentum i.e. each particle 
in the beam is in a momentum eigenstate. If the beam is assumed to be incident from the 
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left, the corresponding eigenvalue, p, is positive and the energy is 

p 2 

E=t—. (5.59) 

2m 

The position representation of the momentum eigenstate (equation 3.18) would then be 

Ui = Aexp(ipx/H). (5.60) 

However, the hamiltonian of the system can be seen not to commute with the momentum 
operator and hence this initial momentum eigenstate will change. The state that it will 
change into must still be an energy eigenstate of eigenvalue E (equation 5.59) as energy is 
conserved. Hence, the position representation, u. of this solution must still satisfy the time 
independent Schrodinger equation 

Hu = Eu 


or 


h 2 d 2 u 


2 m dx 2 

In the region where x < 0, this would become 


+ Vu = Eu. 


d 2, u 2m E 

dx 2 h 2 


A general solution of equation 5.62 is 


(5.61) 


(5.62) 


u = Aexp(ifoc) + B exp (—Hex) 


(5.63) 
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where A and B are arbitrary constants yet to be determined and 

k = ( 2mE/H 2 ) 1/2 = p/h. (5.64) 

Hence, we see that the first term in the solution is exactly the incident beam given by 
equation 5.60. The additional term is by itself also a momentum eigenfunction, but with a 
momentum equal and opposite to that of the incident beam. Hence, it must be interpretted 
as the beam reflected by the potential barrier. 


u r = B exp(-ikx) = Bexp(—ipx/Ti). 


(5.65) 


In the region where x > a, the potential is again zero and hence the Schrodinger equation 
still has the form given by equation 5.62 and consequently its solution is 

u = C exp(ikx) + D exp(-ikx) . (5.66) 


The first term of this solution is a momentum eigenstate with positive momentum and 
hence must be the transmitted beam. The second term is a momentum eigenstate with 
negative momentum. But that would mean a beam coming in from the far right. As there 
is no such beam, we must set D=0. This leaves us with only the transmitted beam on the 
right side of the barrier i.e. 

u = u t = C' exp(ikx). (5.67) 


The one dimensional probability current would be the x component of S as given in equa¬ 
tion 5.16. 


s — A- 

2m 


T- 


d^* 


ijr* 


dA> 


dx dx 

Hence, from equation 5.60, the magnitude of the incident current is found to be 


(5.68) 


N = 


Hi 

2m 


Ui 


du * 

dx 


u 


. diii 

dx 


= —\A\ 2 , 
m 


(5.69) 


and similarly from u r of equation 5.65 and ut of equation 5.67 we find 


i 7~> 12 , . o 

n r = — B 2 , n t = — C 2 . 
m m 


(5.70) 


Now, from the definitions of the reflection and the transmission coefficients given in equa¬ 
tion 5.22, we get 


\Bl |Cf 

\A\ 2 ’ \A \ 2 ' 


(5.71) 


To find these coefficients we need to find the solution to equation 5.61 in the region where 
0 < x < a. The general solution in this region can be seen to be 


■u = F exp(-Kx) + G exp(Kx) 


( 5 . 72 ) 
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where F and G are arbitrary constants and 

K = [2 m(V 0 - E)/h 2 ]V 2 . (5.73) 

K is real if E < Vo and imaginary if E > Vq. The boundary conditions of theorem 5.2 
applied to the solutions in equations 5.63, 5.67 and 5.72 at the two boundaries x = 0 and 
x = a will give 


A + B 
ikA — ikB 
F exp (—Ka) + G exp(A'a) 
KF exp (—Ka) + KGexp(Ka) 


F + G, (5.74) 

-KF + KG, (5.75) 

Cexp(ika), (5.76) 

ikC exp(ifca). (5-77) 


These are four equations for the five unknown constants. Hence, all five constants cannot 
be determined. However, as seen from equation 5.71, for the observable quantities all we 
need are some ratios of the constants. So we divide the equations 5.74 through 5.77 by A. 
This gives us four equations for the four unknowns B/A, C/A, F/A and G/A. After some 
straightforward but tedious algebraic manipulations we find the two relevant solutions to 
be 


B (A' 2 + k 2 )[l — exp(2Aa)] 

A ( K + ik) 2 — (K — ik ) 2 exp(2A'a) ’ 

C HKk exp[( K — ik)a] 

A ( K + ik) 2 — (K — ik) 2 exp(2 Ka) 


(5.78) 

(5.79) 


If Vo > E then K is real. Then from equation 5.71 and some more tedious algebraic 
manipulations we get 


4E(V 0 - E) 
Vq sinh 2 (Ka) 

Vq sinh 2 (Ka) 
4 E(Vq — E) 


(5.80) 

(5.81) 


For Vq < E, K is imaginary such that j = iK is real. Then R and T would have the forms 


4E{E-V 0 ) 
Vo sin 2 (ja) 

Iff sin 2 {ja) 
4 E{E - Vq) 


(5.82) 

(5.83) 


For all energies it can be verified that 


R + T = 1. 


(5.84) 
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This is due to the conservation of probability. 

The above results are in direct contradiction with classical results. Classically, for 
E < V o, there can be no transmission at all. But here we see that T is nonzero. This effect 
has been called “quantum mechanical tunnelling” for want of a better name 2 . It has been 
observed experimentally in a variety of systems and has also been used in the design of 
electronic devices. Also, classical physics would not allow any reflection if E > Vo. Once 
again the above quantum results show otherwise. 


Problems 


1. Show that equation 5.9 holds for any system with a fixed number of particles if p p is 
the particle density and S p is the particle current density. 

2. Show that for a solution, u, of equation 5.5 to be allowed, it must be finite at all 
points. [Hint: Use results of section 5.1 showing that a solution can go to infinity 
only in a region where E < V and in that case it is disallowed.] 

3. Sketch the energy eigenfunctions for the four lowest eigenvalues of the rectangular 
potential well problem. Assume Vq to be large enough to allow four bound states. 


4. Find the energy eigenvalues and eigenfunctions for an infinitely deep rectangular 
potential well i.e. Vq —> oo. 


5. Find the reflection and transmission coefficients for the scattering states (E > Vq) of 
the rectangular potential well. 


6 . Find the reflection and transmission coefficients for the step potential given by 


V(x) 


where Vq is a positive constant. 


0 for x < 0 
Vq for x > 0 


7. Consider the following one dimensional potential 

( V\ for x < 0 
V(x) = < 0 for 0 < x < a 
[ V 2 for x > a 

where V\ and V 2 are positive and constant and V 2 > V\. For this potential find 

2 Here our classical mindset is seen to hinder the understanding of quantum phenomena. Classically, 
particles must have continuous trajectories. Hence, we understand that if a particle travels from one side of 
a barrier to the other, it must have “tunnelled” through. In other words, for some period of time, it must 
have moved through the barrier. But this is energetically impossible! On the other hand, in a quantum 
theory the concept of a trajectory is nonexistent. So the particle does not actually have to go through the 
barrier to be 011 the other side. In fact, the probability current inside the barrier can be seen to be zero. 
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(a) the conditions for energy eigenstates to be bound or scattering, 

(b) the bound state energy eigenvalues, 

(c) and the reflection and transmission coefficients for the scattering states. 



Chapter 6 


Numerical Techniques in One 
Space Dimension 


For our purposes we shall assume that for any well-defined problem in quantum mechanics 
it is possible to obtain a solution of acceptable accuracy using some computing machine. Of 
course, for some problems the required computing machine might not be available presently. 
Hence, the greatest challenge for the numerical analyst is to find methods that can solve 
practical problems in reasonable amounts of time using present day computers. This has 
resulted in some rather involved numerical techniques. A complete discussion of these 
techniques would divert our attention away from quantum mechanics. Hence, we shall not 
attempt such a task here, hoping to delegate it to trained numerical analysts. However, 
even for the physicist that is going to delegate the task, it is important to be familiar with 
the principles. It helps in two ways. First, it allows physicists to solve simple numerical 
problems on their own. Second, it lets them communicate better with numerical analysts 
while solving physics problems. 

In the following, some simple and intuitive numerical methods will be discussed to 
build the groundwork for future development. Some sample programs written in the C 
language are provided to illustrate the methods (see appendix A). The language C is 
chosen rather than the more popular FORTRAN to provide greater flexibility. Conversions 
to other languages should be straightforward once the material in this chapter is understood. 
Although FORTRAN or BASIC could do the job, structured languages like PASCAL or C 
should be preferred. Some later versions of FORTRAN allow structured programming. But 
they still require programming discipline to avoid unstructured programs. If BASIC is used, 
it should be noted that some of the programs may take several hours on a microcomputer. 
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Figure 6.1: Discretization of a function. 

6.1 Finite differences 


In solving differential equations numerically, it must be noted that the basic concept of 
limits as defined in calculus, needs to be approximated. Infinitesimal quantities need to be 
replaced by small but finite quantities. This would, of course, introduce errors. However, 
such errors can be made indefinitely small by running the computer for a long enough time. 
A little reflection on the nature of the so called analytical solutions shows that even they 
cannot provide exact numerical results for real life applications. For example, the value of a 
trigonometric function of an arbitrary angle can be evaluated only upto a certain accuracy 
in a finite amount of time! 

Approximate forms of derivatives that are needed for numerical computations are 
called FINITE DIFFERENCES. The resulting equivalents of differential equations are 
called DIFFERENCE EQUATIONS. To determine the form of finite differences, we shall 
consider the function f(x) which depends only on x. The values of f(x) are expected to 
be known or computed only at a discrete set of values of x. For simplicity, this set of 
values of x will be assumed to be equally spaced and members of the set will be denoted 
by Xi (i = 0,1, 2,...) (see fig. 6.1). The corresponding values of f(x) will be denoted by _/) 
(i = 0,1, 2,...). The interval between two consequtive values of x* will be given by 

Xi — Xi-I = w. (6.1) 

The value of w, in principle, can be reduced indefinitely to obtain more accurate results. 
However, as w is reduced, computation time increases. Also, a reduction in w must be 
matched with a suitable increase in the number of significant figures to make sure that 
roundoff errors do not spoil the accuracy. 

It can be intuitively seen that the value of the first derivative (the slope) of f(x) at a 
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point Xi can be approximated by the expression 


A Si 


fi+1 fi—1 

%i +1 Xi —i 


fi+1 Ji—l 
2 W 


( 6 . 2 ) 


We shall call this the first difference. To confirm this intuitive result we shall expand /, + 1 
and fi -1 each in a Taylor series as follows. 


fi+1 = + + + + (6.3) 

fi—i = fi-wf' + ^-f' / -^-f'" + ..., (6.4) 

where f[. /" and /"' denote the first three derivatives of f{x) at Xi. Subtracting /$_i from 
/j + 1 and then dividing by 2w gives 

/' = /i+1 2 ~ /i_1 + °(^ 2 ) = A /* + °(^ 2 ) (6-5) 

where 0(w 2 ) has terms of order two or higher in w. For small enough w. 0(w 2 ) can be 
ignored. 


To obtain an intuitive expression for the second difference (an approximation of the 
second derivative), we notice that the first difference at the point halfway between Xi and 
Xi- i is 

fi- 1/2 = (fi fi-i)/w, ( 6 . 6 ) 

and at the point halfway between X {+1 and Xi it is 

fi+ 1/2 = (fi +1 - fi)/w. (6.7) 

The second difference at Xi should then be 

A 2 fi = (fi+ 1/2 - fi-i/2)/w = (f i+ 1 - 2 fi + fi-Q/w 2 . (6.8) 

To verify that equation 6.8 is an approximate form of the second derivative of f(x), we add 
the two expressions in equations 6.3 and 6.4. Then solving for ff would give 

fi = (fi+ 1 - Vi + fi-i)/w 2 + 0(w 2 ) = A 2 fi + 0(w 2 ). (6.9) 

In the following, only second order differential equations will be discussed. Hence, equa¬ 
tions 6.2 and 6.8 will be the only finite differences we will need. 


6.2 One dimensional scattering 


In chapter 5 we discussed one dimensional scattering from simple barriers (rectangular). 
Analytical solutions were possible in these cases. In a general case the form of the wave- 
function is still the same very far to the left and right of the barrier as all physical forces 
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are expected to die out at large distances. Also, in physical problems measurements of 
scattered and reflected waves are made at large distances where the wave is virtually free 
of forces. Hence, in the following, one dimensional space will be broken into three regions 
as follows (fig. 6.2). 

I. Region of incident (and reflected) beam (V = 0). 

II. Region of scattering (V ^ 0). 

III. Region of transmitted beam (V = VQ. 


Region I is force free and hence, has a constant potential. With a suitable choice of reference, 
this potential can be taken to be zero. Region III is also force free. However, as the choice 
of reference has already been made in keeping the potential in region I as zero, the potential 
in region III cannot, in general, be zero. But it will still be a constant V). The region of 
nonzero force (region II) has been chosen to be between x = —a and x = 0. This choice of 
coordinate origin is made for future convenience. For numerical solutions of problems the 
position representation is usually preferred as it leads to differential equations and standard 
numerical techniques for differential equations are well known. Hence, we shall use the same 
method as given in chapter 5 with appropriate changes made for the more general potential. 
The equation to be solved in the three regions is still the time independent Schrodinger 
equation as given in equation 5.61. 


h 2 d 2 u 
2m dx 2 


+ Vu = En. 


From equation 5.63, we already know the solution in region I to be 


( 6 . 10 ) 


u = AexpQkx) + B exp (—Hex), for x < —a. 


( 6 . 11 ) 
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In region III V (= V t ) is a constant and hence equation 6.10 can be written as 

H = (6,12) 

where kt is a constant given by 

k t = [2 m(E - V t )/n 2 ] 1/2 . (6.13) 

Hence, the solution of equation 6.12 would be 

u = C exp(ikt.x) + D exp(—ikt.x), for x > 0. (6.14) 

If E < Vt, then kt is imaginary. This would make the second term in equation 6.14 go 
to infinity at x = +oo. As this is not allowed, D must be zero. Such a solution can be 
seen to lead to a zero transmission probability irrespective of the potential in region II (see 
problem 6 of chapter 5). Hence, for a numerical solution we need to consider only the case 
E > Vt.. So kt must be real and can be chosen to be positive. In chapter 5 it was shown 
that the second term in equation 6.14 would be due to a particle moving backwards in 
region III. Such a particle is not possible as there is no source or reflecting force in region 
III. Hence, D = 0 and 

u = C exp(ik t x), for x > 0. (6.15) 

The solution in region II is to be found numerically for arbitrary potentials. Numerical solu¬ 
tions cannot be written in a general form as an arbitrary linear combination of two linearly 
independent solutions. The freedom of the two arbitrary constants needs to be removed 
through boundary or initial conditions before solving the equation. These conditions must 
come from the known solutions in regions I and III and the continuity conditions at the 
boundaries of the three regions (theorem 5.2). For the computation of the only measurable 
quantities, (the reflection and transmission coefficients) we have seen that only the ratios 
of the constants A, B and C are needed. Hence, without loss of generality, one can set any 
one of these constants to be equal to 1. Depending on which of the constants is chosen 
to be unity, we need different numerical methods. At present we are going to discuss the 
method in which C = 1. This will be later seen to be the simplest. 


As a computing machine, in general, cannot handle complex numbers, one must write 
equation 6.10 as two separate equations for the real and imaginary parts. Separating u in 
its real and imaginary parts one writes 


u = g + ih, 


(6.16) 


where g and h are real. From equation 6.10 one now obtains the two real equations for g 
and h. 


d 2 g 2 m(E — V)g 
dx 2 + h 2 

d 2 h 2 m(E - V)h 
dx 2 h 2 


(6.17) 


0 . 


(6.18) 
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For a numerical solution of these equations it is convenient to choose a dimensionless vari¬ 
able for the independent parameter. Let y be such a variable such that 


x = cy. 

(6.19) 

A choice of c that will simplify equations 6.17 and 6.18 would be 


c = h/(2mE) 1 ^ 2 = 1/k. 

(6.20) 

Then equations 6.17 and 6.18 become 


0 + (i-r/E ) 9 = o. 

(6.21) 

cl 2 h 

54 + (l -V/E)h = 0. 
dy 2 

(6.22) 


To obtain numerical solutions of the above equations, we use the finite difference approxi¬ 
mation for the second derivative as given in equation 6.8 with the Xi replaced by the y,- so 
that w = iji — y*_i. It leads to the following difference equations. 


9i +1 - 2 gi + gt-i + w 2 ( 1 - Vi/E)gi = 0, (6.23) 

hi +1 — 2 hi + hi -1 + w 2 (l — Vi/E)hi = 0, (6.24) 

where Vi is the value of the potential V at y%. These equations are recursion relations for 

g and h. If y,-. hi, g%-\ and hi-\ are given, these equations can be solved to find gi + \ and 

hi+i that is 


9i+ 1 = [w 2 {Vi/E-l)+2]g i -g i _ 1 , (6.25) 

h i+l = [w 2 (V i /E-l) + 2]h i -h i _ 1 . (6.26) 

Hence, if go, ho, yi and h± are known, g and h can be found at all points. These initial 
values can be found from initial conditions of the differential equations, go and ho are the 
initial values of the functions. To determine yi and h\ we notice that approximate forms 
for the derivatives at the initial point are (see problem 1) 

(Ay)o = (gi - 9o)/w, 

(Afr)o = (hi - h 0 )/w 

Then 

yi = w(Ag) 0 + go, (6.29) 

hi = w(Ah)o + ho- (6.30) 

If the initial values of the functions and their derivatives are known, equations 6.29 and 6.30 
would give the approximations for yi and h\. Hence, in this formulation of the problem, 


(6.27) 

(6.28) 
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initial conditions are easier to handle than boundary conditions. A set of boundary condi¬ 
tions would give values of the functions at both ends of region II, but not their derivatives. 
So values for gi and h\ cannot be found directly. 

Initial conditions in region II can be determined from the results of the analytical 
solutions in either region I or region III if we use the conditions of theorem 5.2. However, 
the solution in region I has two unknowns (A and B). One of them can be set to unity, but 
the other still remains unknown. The solution in region III, on the other hand, has only 
one unknown (viz. C ). This can be set equal to 1 as mentioned earlier. 




C= 1. 

(6.31) 

Now equation 6.15 can be written as 




u = exp (ik t x), for x > 0. 

(6.32) 

Then the values of u and its derivative at x = 0 (or y = 0) would be 




ik) = 1, 

(6.33) 

du 

dy 

1 du 
y=0 k dx 

= A = i(l 

X=0 

(6.34) 

Using equation 6.16 this gives 




9o = 1, 

/lQ = 0, 

(6.35) 


= 0, ^ = (1 - Vt/E) 1 / 2 . 

dy v =o dy y=0 v 

(6.36) 


As the region of computation (region II) is to the left of this initial point (y = 0), we 
number the indices of y. g and h in increasing order from right to left such that 


2/o = 0, yi = yi -1 - w. (6.37) 

This makes (A g) 0 and (A/i)o, the approximate forms of the derivatives (equations 6.27 
and 6.28), change in sign. Hence, replacing approximate derivatives by the negatives of 
actual derivatives (equation 6.36), equations 6.29 and 6.30 will give 

9i = l, hi = —w{l — Vt/E) 1 / 2 . (6.38) 

Now, equations 6.35 and 6.38 will provide all the initial values needed to solve the recursion 
relations in equations 6.25 and 6.26. If, in region II, there are n intervals of width w each, 
then the last computed value of gi and hi will be g n and h n respectively. This will complete 
the numerical solution of the differential equation. 

To determine the reflection and transmission coefficients we need to compute the 
constants A and B. Once the numerical solution for u is known in region II, A and B 
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can be found by matching the values of u and its derivative (with respect to y) u y at the 
boundary of region I and region II. In doing this we use equations 6.11, 6.19 and 6.20 to 
get 


u = 

Hexp(iy) + B exp(— iy) 

(6.39) 

Uy = 

iAexp(iy) — iB exp(-iy). 

(6.40) 

This leads to 




| 1 | 2 \Uy+iu\ 2 

(6.41) 


IDI2 \Uy~iu\ 2 

\B\ = 4 ■ 

(6.42) 

Then using equation 6.16 we get 



|H| 2 

(g y - h) 2 + (hy + g) 2 

4 

(6.43) 

\B\ 2 

(g y + h) 2 + (hy - g) 2 

4 

(6.44) 


where g y and h y are the derivatives of g and h with respect to y. Due to theorem 5.2, the 
values of g, g y , h and h y must be the same at x = —a (y = — ka) using either the solution in 
region I or the numerical solution in region II. Hence, the approximate values from region 
II can be introduced in equations 6.43 and 6.44 to abtain 


|H | 2 

\B\ 2 


(dyn h n ) T ( h yn -f- (jii ) 

4 

([hyn T h n ) T ( h yn y n ) 

4 


(6.45) 

(6.46) 


where g n and h n are the last computed values from the recursion relations of equations 6.25 
and 6.26 i.e. they are the values of g and h at x = —a (y = — ka ) which also means nw = ka. 
g yn and h yn are the approximations of the derivatives of g and h at y = —ka. 


9yn — ([In— ] [hi ) l 1 -^ • h yn — (h n —\ h n )/'W. 


(6.47) 


Now, the transmission coefficient T and the reflection coefficient R can be found from the 
definitions in equation 5.22. The currents N and n r are the same as in equations 5.69 
and 5.70. However, as kt is no longer the same as k, nt is given by 



hk t . |2 

n t = - C . 

m 

(6.48) 

fctiq 2 

4(1 - Vt/E) 1 ' 2 

(6.49) 

k\Ap ~ 

(Syn {hyn "t" 9n 

\B\ 2 _ 

pip - 

{dyn hn) + {hy n g n ) 

{dyn h n + {hy n ~\~ Qn)^ 

(6.50) 


Hence, we obtain 
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Thus an algorithm for the computation of transmission and reflection coefficients will con¬ 
tain the following parts. 

• Determine desired values of E , Vt and w. For better results, the function should not 
change much in any interval w. Hence, the wavelength of the wavefunction must be 
large compared to w i.e. w <C 1/k. 

• Set the initial values go, ho, g\ and h\ using equations 6.35 and 6.38. 

• Use the recursion relations in equations 6.25 and 6.26 in a loop to compute all values 
of g t and hi upto g n and h n . In each step of the loop, y is to be decreased by w and 
the value of the potential is to be computed at that value of y. The loop is to be 
terminated when y < —ha. 

• Reflection and transmission coefficients are to be computed using equations 6.47, 6.49 
and 6.50. 


6.3 One dimensional bound state problems 


In bound state problems the energy eigenvalues and eigenfunctions need to be found. A 
rather simple method is possible for potentials that have a reflection symmetry. As will be 
seen in chapter 7, wavefunctions for such potentials are either symmetric or antisymmetric. 
Hence, solving the equation for positive x is sufficient. The antisymmetric wavefunctions 
vanish at the origin and the symmetric wavefunctions have zero slope at the origin. This 
initial condition, along with an arbitrary choice of normalization constant, is sufficient for 
the numerical solution of the equation. As the energy eigenvalue is not known a priori, one 
needs to solve the equation for a series of energy values and observe the behavior of the 
function at some large distance (viz. the “tail”) for each case. If the magnitude of the tail 
increases rapidly with distance, one concludes that the energy is not an energy eigenvalue. 
In the neighborhood of an energy eigenvalue a slow change in the chosen energy will show 
a rapid change in the tail. In fact the tail can be seen to change sign precisely at energies 
that are eigenvalues. This change in sign is so rapid that the solution for the wavefunction 
is extremely unstable and inaccurate at large distances. However, the rapid change in sign 
for small changes in energy can be used to locate the energy eigenvalue very precisely. To 
illustrate this method, we shall use it for the harmonic oscillator problem that has already 
been solved analytically. Hence, it will be a good test for the method. 

As in the scattering case, the numerical solution of bound state problems is also fa¬ 
cilitated by choosing an appropriate dimensionless independent variable. For the harmonic 
oscillator problem, this has already been done in chapter 4. The choice of the indepen¬ 
dent variable y, is given in equations 4.43 and 4.44. The resulting equation to be solved is 
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equation 4.45 which we write here without the subscript E of the function u. 

j + (e - y 2 )u = 0. (6.51) 

The eigenvalue e will now be computed numerically and then the energy eigenvalue E will 
be found using the relation in equation 4.46. Due to theorem 5.1 we know that only real 
eigenfunctions need be found and hence, we choose u to be real. By rising the finite difference 
form of equation 6.8, we can write the difference equation corresponding to equation 6.51 
to be 

Ui +1 = [(y 2 - e)w 2 + 2 ]m - Ui- 1 . (6.52) 

For even solutions, one knows that u is nonzero at the origin. This initial value can be 
arbitrarily chosen to be 1, as the function need be known only upto an undetermined 
multiplicative constant (see rule 2). For the initial condition on the derivative, one can 
write an equation similar to equation 6.29: 

ui = w(Au)o + uo■ (6.53) 

For even solutions, u must have a derivative of zero at the origin. Hence, the approximation 
of the derivative in equation 6.53 can be set to zero. Thus we get the following initial 
conditions. 

Ui = Uo = 1 (even). (6.54) 

For odd solutions the value at the origin must be zero. But the value of the slope at the 
origin can be chosen arbitrarily due to rule 2. We shall choose the approximate value of 
the initial slope to be 1. Hence, the initial conditions are 

uo = 0, u i = w (odd). (6.55) 

Using the conditions of equation 6.54 or 6.55, one can solve the recursion relation of equa¬ 
tion 6.52 using e = 0. The computation should be stopped at some value i = n such that 
u n is very large (say 100). The sign of u n should be noted. Next, the process should be 
repeated several times for values of e incremented by a small amount (say e^) each time. 
If the sign of u n changes between two consequtive value of e (say eo and eo + e^), then, for 
some energy between these two values, the wavefunction must go to zero at large distances 
- This energy would be the lowest eigenvalue. If is chosen to be smaller than tolerable 
error in eigenvalue computation, then eo itself can be accepted as an approximation of the 
lowest eigenvalue. If the above process is continued for higher energies, higher eigenvalues 
can be obtained. 

Very often the that is needed for the desired accuracy is so small that computation 
time becomes unacceptably large. Hence, the search for eigenvalues needs to be done in a 
more sophisticated manner. A binary search would be faster. However, to conduct such a 
search, one would need to know intervals in e that have one and only one eigenvalue each. 
Thus a practical approach would be to conduct a rough linear search as before with an 
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just small enough such that the interval [e, e + ej can have at most one eigenvalue. Once 
the interval in which an eigenvalue is located is found, a binary search within the interval 
can be conducted. This search technique is similar to the bisection method outlined in 
chapter 5 for the solution of equations 5.54 and 5.55 (see problem 2). 

As one knows that only the eigenvalue is the observed quantity, the above numerical 
technique is usually quite sufficient. However, very often, to estimate the changes in an 
eigenvalue due to small changes in the potential energy function, one needs the wavefunc- 
tion. The above method gives the correct wavefunction except for the unstable tail. The 
tail region of the solution is easily identified by a catastrophic increase (or decrease) in the 
computed function. Setting this part of the solution to be identically zero gives a reason¬ 
ably good approximation for the wavefunction. However, for better results one may use a 
matrix method (see problem 3) to compute the function in the tail region. To do this one 
may assign the boundary condition of the function at a large distance to be zero. The other 
condition is that of continuity with the solution already obtained before the tail region (see 
problem 5). The matrix method could, of course, have been used for the complete solution 
rather than just the tail. But that would require a larger computer memory. 

If the potential function V(x) is not symmetric, finding bound state energy eigenvalues 
can become more tricky. In such a situation the matrix method might be easier to use. 
The value u( 0 ) (= uq) of the function at the origin can be chosen to be 1 at the origin. 
Then for some energy E , a matrix solution can be found from x = — oo to x = 0 using the 
boundary conditions u(— oo) = 0 and u( 0) = 1 . Here infinity is understood to be a large 
enough value for computer usage. Next, for the same energy E, another matrix solution 
can be found from x = 0 to x = +oo. The derivatives of the two solutions must match 
at x = 0. This matching condition can be used to search for the eigenvalues of energy. If 
by accident the origin is chosen at a point where the wavefunction vanishes, a shift in the 
coordinate system would be necessary. 


6.4 Other techniques 


The numerical methods discussed in this chapter are some of the more intuitive and theoret¬ 
ically straightforward ones that are known. They are to be considered as only a beginning. 
In many situations the physicist needs to develop more specialized and sometimes more 
involved methods to suit the needs. Numerical tricks, that improve accuracy and speed in 
special cases, are continually being developed by numerical analysts. Hence, the reader is 
also encouraged to develop his (or her) own methods whenever the need arises. 



CHAPTER 6. NUMERICAL TECHNIQUES IN ONE SPACE DIMENSION 


75 


6.5 Accuracy 


In any numerical technique it is crucial to know the degree of accuracy. Without an 
error estimate the numerical results are quite worthless. One of the simplest and practical 
methods of error estimation involves computing the change in the final results due to a 
change in the interval w used in the solution of the differential equation. The number 
of significant figures of the solution that do not change due to a decrease in w, give the 
accuracy of the solution. 

The approximation methods used in this chapter are based on expanding a function 
in a Taylor series and selecting a suitable number of terms from it. The error introduced 
by ignoring higher order terms in the Taylor series is called a truncation error. Truncation 
errors can usually be reduced by choosing a small enough interval w. However, very small 
intervals can increase what are called roundoff errors. The source of roundoff errors is the 
following. 

A machine computation is usually done with a certain fixed number of significant 
figures. When the interval w is chosen to be small, it very often requires the computation 
of the small difference between two large numbers (e.g. a first difference). The number 
of significant figures of the difference can be seen to be much smaller than that of the 
original numbers even though the computer will arbitrarily fill in the missing significant 
figures. For example, the two numbers, 4.02395 and 4.02392, have six significant figures 
each. However, their difference, 3 x 10~ 5 , has only one significant figure. The computer will 
fill in significant figures and might consider this difference to be 3.00000 x 10 -5 . The error 
introduced by arbitrarily assigning these extra significant figures is called a roundoff error. 
Roundoff errors can be reduced by choosing a larger number of significant figures for all 
computations and thus paying the price through longer computation times. Most computer 
languages provide at least two different choices of significant figures (viz. single or double 
precision). Higher precision computation can be obtained by some custom programming 
for the underlying arithmetic operations. 

Truncation errors can also be reduced by using higher order algorithms that use a 
larger number of terms of the Taylor series expansion. Such methods are beyond the scope 
of this book. 


6.6 Speed 


Computation speeds can of course be improved by better computer hardware. Such im¬ 
provements are limited only by technology and the financial status of the physicist. On the 
other hand improving speed through efficient software is an art that is often learnt from 
experience. In any effort in machine computation one needs to strike the right balance in 



CHAPTER 6. NUMERICAL TECHNIQUES IN ONE SPACE DIMENSION 


76 


accuracy, computation speed, and time (or money) spent on software. While discussing the 
accuracy of computations we have already shown that, in general, an increase in accuracy 
is accompanied by a decrease in speed. 


Problems 

1. Show that equations 6.27 and 6.28 give approximate forms of the derivatives of g and 
h at the origin. 

2. Based on the bisection method outlined in chapter 5 for the solution of equation 5.54, 
describe a method for the binary search of an eigenvalue (for a bound state) when an 
interval containing one and only one eigenvalue has already been located. 

3. Show that recursion relations like those of equations 6.25, 6.26 and 6.52 can be written 
as n — 1 linear algebraic equations for n + 1 unknowns. This set of equations can 
be solved using numerical matrix methods if two more independent equations are 
included. Show the following: 

(a) Initial conditions can provide these two equations. 

(b) Boundary conditions on both ends can also provide these two equations. 

4. Show that the one dimensional scattering problem can be solved by the matrix method 
of problem 3, if in equation 6.11 A is chosen to be 1 while B and C (of equation 6.15) 
are computed from the solution. For such a choice, find the four extra equations (two 
for the g recursion relation and two for the h recursion relation) needed to solve the 
matrix equations. 

5. Find a numerical method based on the matrix method of problem 3 for the computa¬ 
tion of the tail part of a bound state eigenfunction. Use the boundary condition that 
the function goes to zero at large distances. 

6. Develop a computer algorithm to compute the reflection and transmission coefficients 
for the junction potential of a semiconductor p-n junction. Such a potential is given 
by the following. 


V = 0 x < —d p 

= K p (x + dp) 2 — d p < x < 0 
= V t - K n (x - d n ) 2 0 < x < d n 
— Vt x d n . 

where d V) d n , K p and K n are given constants characterizing the potential and Vt = 
Kpdp + K n d n . 
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7. Develop a computer algorithm to compute the eigenvalues for a quartic potential: 

V = Kx 4 . 


where K is a given constant. 



Chapter 7 


Symmetries and Conserved 
Quantities 


7.1 Symmetry groups and their representation 


A general transformation of a system can be visualized as a coordinate transformation in 
some arbitrary coordinate system. A symmetry transformation is a transformation that 
keeps the physical characteristics of the system unchanged (for example, a rotation of a 
spherical object). In classical mechanics a symmetry transformation is defined as follows. 

Definition 28 A CLASSICAL SYMMETRY TRANSFORMATION of a system keeps the 
form of the hamiltonian unchanged. 

A quantum symmetry transformation can be defined to be the same. However, one knows 
that the quantum hamiltonian ( H ), like any operator, is specified completely by its oper¬ 
ations on all possible states. As the set of eigenstates of the hamiltonian form a complete 
set, it would then suffice to specify the operation of H on all its eigenstates. This would 
of course amount to specifying all eigenvalues of H. Hence, for convenience, the following 
alternative definition will be used for quantum mechanics. 

Definition 29 A QUANTUM SYMMETRY TRANSFORMATION keeps the set of all 
eigenvalues and eigenstates of the hamiltonian unchanged. (Note: degenerate states with 
the same eigenvalue can exchange places in such a transformation.) 

In both classical and quantum mechanics, it can be seen that symmetry transformations 
become important due to their relation to conserved quantities. However, in quantum 
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mechanics the imporatance of symmetries is further enhanced by the fact that observation 
of conserved quantities can be exactly predictable in spite of the probabilistic nature of 
quantum predictions (see chapter 4). Hence, in this chapter, we shall study the nature of 
symmetry transformations in quantum mechanics. 

Let us consider an arbitrary transformation of an arbitrary state |s) to be given by 
the operator U such that the transformation gives 

I s)^U\s). (7.1) 

If U were to produce a symmetry transformation, the following theorem can be proved. 

Theorem 7.1 If the operator U produces a symmetry transformation on all ket vectors, 
then it must commute with the hamiltonian. 


Proof: By definition of a symmetry transformation, the operator U could transform an 
energy eigenstate either to itself or another eigenstate degenerate to it. Hence, if | Ef) 
is an eigenstate of H with eigenvalue £) then 

HU\Ei) = H\E’) = Ei\E') 

= E i U\E i ) =UE i \E i ) =UH\Ei) (7.2) 

where \Ef) and \E[) are either degenerate or the same. This gives the result 

[H,U]\Ei) = 0. (7.3) 

The above equation is true for all energy eigenstates | Ef). From the completeness 

theorem one knows that any arbitrary state |s) can be written as a linear combination 
of the eigenstates \Ef). Hence, it follows that 

[H,U]\s) = 0 - ( 7 - 4 ) 

As |s) is an arbitrary ket vector, one concludes 

[H,U} = 0. (7.5) 


This proves the theorem. 


The following definitions are going to be useful in the future. 

Definition 30 An operator U is called UNITARY if 

U^U = I 


where I is the identity operator defined in chapter 1. 
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Definition 31 An operator Q is called ANTILINEAR if 

Q{a\r) + b\s)) = a*Q\r) + b*Q\s) 

where a,b E C. 

Definition 32 An operator U is called ANTIUNITARY if it is antilinear and 

(r\U^U\s) = (r|s)* = (s|r) 


Now the following theorem can also be proved for symmetry transformations. 

Theorem 7.2 If a linear operator U produces a symmetry transformation then it is uni¬ 
tary. 


Proof: Let the set of eigenstates of the hamiltonian H be {|£7*)} and let the operation of 
U on this set be given by 

U\Ei) = | E[). (7.6) 

By the definition of a symmetry transformation the set {| A')} is also that of eigen¬ 
states of the hamiltonian. They can also be chosen to be normalized. Consider two 
arbitrary states |r) and |s). They can be written as linear combinations of the energy 
eigenstates | Ef). 



k> = 51 ai 

a 

1 Ei) 

(7.7) 


1 s) = 

i 

(7.8) 

Then, as U is linear, equation 

7.6 gives 



U\r) 

= E i aiU\Ei) 

=E“.i £ :> 

(7.9) 

U\s) 

= E j b j U\E j ) 

j 

(7.10) 

Hence, 

(r\U*U\s) = 

= J2 a i b AE' \E$ 

= J2 a i b i = ( r l S )‘ 

(7.11) 


ij i 


Here the orthonormality of energy eigenstates is used. As equation 7.11 is true for 
any two states |r) and |s), it follows that WU is the identity operator I. 

U ] U = I. (7.12) 


This completes the proof. 
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A more general form of theorem 7.2 can also be proved. However, here we shall state it 
without proof as follows. 

Theorem 7.3 An operator U that produces a symmetry transformation must be either 
unitary or antiunitary. 

We shall later see that the time reversal symmetry operator is an example of an antiunitary 
operator. 

The set of all symmetry transformations of a system is called its symmetry group as 
it satisfies the mathematical properties of a special type of set called a group. A group is 
defined as follows: 

Definition 33 If G is a GROUP and A , B, C £ G, then the following are true: 

1. The product AB £ G if the product is defined as two successive operations. 

2. The identity transformation I £ G where I is defined by AI = IA = A. 

3. Every A (A G) has a unique inverse A(€ G) such that AA~ l = A -1 A = I. 

4- The product is associative i.e. A(BC ) = {AB)C. 

Definition 34 A CONTINUOUS GROUP (e.g. rotation group) is one whose elements 
have a one-to-one correspondence with the values of a set of continuous variables called the 
GROUP PARAMETERS (e.g. angle of rotation). 


Definition 35 If the algebra of a group can be realized by a set of operators, then this set 
is called its REPRESENTATION. 


Definition 36 The operators on the quantum states of a system, that generate transfor¬ 
mations of these states according to the group elements of the symmetry of the system, 
must form a representation of that group. This will be called the QUANTUM STATES 
REPRESENTATION of the group. 


If 6 is a group parameter, a group element (in its quantum state representation) infinitesi¬ 
mally different from the identity can be written as 

U(dB) = I + QdB , (7.13) 

where Q is an operator. From theorem 7.3 we know that U(dB ) could be either unitary or 
antiunitary. If it were antiunitary, it would have to be so even in the limit of cl6 going to zero. 
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But from equation 7.13, we see that for cW = 0, U{d6 ) = I which is not antiunitary. Hence, 
we conclude that U(dO) must be unitary and in general the quantum states representation 
of any continuous group must be unitary as a continuous variation of the group parameters 
cannot bring about the discontinuous change from unitary to antiunitary. Unitarity of 
U{d6) gives 

I = U\d6)U{dO) = (/ + Q ] d6)(I + QdO) /-((/ + Q)dB, (7.14) 

where second order terms in the infinitesimal d8 are dropped. Hence, 

Q ] = -Q. (7.15) 

If Q = iJ then equation 7.15 would give J to be hermitian, and 

U(d0) = I + iJd0. (7.16) 

To obtain a finite group operation one may operate on a state n times with U ( d6 ) and then 
let n —> oo, such that ndO = 8 is finite. This gives 

U{8) = lim (/ + iJ8/n) n = exp (iJd), (7-17) 

n—xx) 

where J, the hermitian operator, is called a GENERATOR of the group. Very often J can 
be found to be an observable. 

Corollary 7.1 From theorem 7.1 mid theorem ^.2, it follows that a generator of a symme¬ 
try group must be a conserved quantity (see problem 1). 


In chapter 4 we had shown that conserved quantities are particularly important for quan¬ 
tum mechanics because they are the only quantities whose measurements can be predicted 
precisely in the absence of experimental error. Corollary 7.1 gives a way of identifying these 
conserved quantities through the symmetries of the system. Hence, it becomes important 
to study the symmetries of a system. In the following sections we shall investigate some 
common symmetries and the corresponding conserved quantities. 


7.2 Space translation symmetry 


A system that appears the same from different points in space is said to have space trans¬ 
lation symmetry or just TRANSLATION SYMMETRY. Let U s ( q) denote the quantum 
states representation of the translation group for a translation in space by the displacement 
vector q. The operation of this symmetry operator on an arbitrary state |s) can be best 
understood in its position representation ip s (r). First, considering a one dimensional system 
we give an infinitesimal translation of dq to the one dimensional wavefunction ip s (%) = (a;|s). 
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The result of the translation should make the new function at x be equal in value to the 
old function at x — clq i.e. 

(x\U s (dq)\s) = (x — clq\s) = ip s (x — clq ) 

= ip s (x) - = (l ~dq^ji>s{x). (7.18) 

Comparing with equation 7.16 we see the generator for translation in one dimension to be 
proportional to the momentum operator P (= — ihd/dx in the position representation) i.e. 

(x\U s (dq)\s) = (1 -iPdq/h)ip s (x). (7.19) 

For a finite translation of q we use equation 7.17 to obtain 

(x\U s (q)\s) = exp (-iPq/h)ijj s (x). (7.20) 

Hence, in general, the translation operation in one dimension is represented by 

U s (q) =exp(-iPq/h). (7.21) 

This result can be generalized for the three dimensional case to give 

U s ( q) = exp(—iP • q/h), (7.22) 

where P is the momentum vector operator. Now, from corollary 7.1 we can at once conclude 
that in a translationally symmetric system the momentum is conserved. The free particle 
is an example of such a system. 


7.3 Time translation symmetry 


The Schrodinger equation is postulated to generate time translation. Hence, the quantum 
states representation of the time translation operator must be directly derivable from the 
Schrodinger equation. This has been done in problem 2 of chapter 4. Thus we find the 
time translation operator to be 


U t (t) = exp (—iHt/h). (7.23) 

This can be seen to be a symmetry operation for all conservative systems as in chapter 4 
we showed that, for such systems, energy eigenstates do not change with time. 
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7.4 Rotation symmetry 


Under a rotation the transformation of the rectangular coordinates (aq,® 2 >® 3 ) = ( x,y,z ) 
is given by a matrix with elements a,ij(i,j = 1, 2, 3) such that the transformed coordinates 
04,a44) = are given by [2] 

x 'j -- •’ y i CLijXj , i 1.2.3. (7.24) 

i 

The scalar product of two vectors is unchanged by this transformation. This leads to the 
condition 

= ^ ' CLjjO'kj = (7.25) 

i i 

where <5^ is the Kronecker delta. The set of all 3 x 3 matrices that satisfy equation 7.25 
form the rotation group. This group is also known as the orthogonal group in 3 dimensions 
or 0(3). From equation 7.25 it follows that the matrices cijj have the determinant +1 or 
—1. The subset of these matrices that have the determinant +1 is also a group and it 
is called the 50(3) group. 50(3) contains all elements of 0(3) that can be continuously 
transformed to the identity i.e. it does not include coordinate inversions. At present we are 
going to discuss only 50(3) as it has no discrete transformation elements and hence, can 
be represented in the form of equation 7.17. 

A rotation about the z direction by an angle # can be seen to be given by the following 
element of the rotation group. 

( cos 9 — sin 9 0 \ 

sin# cos# 0 I . (7.26) 

0 0 1 ) 


Hence, an infinitesimal rotation about the z direction by an angle d9 is given by 


a z (dO) 


1 -dO 0 \ 
d9 1 0 

0 0 1 / 


(7.27) 


Now we can derive the quantum states representation Ur of 50(3). Under an infinitesimal 
rotation of d9, about the z axis, the position representation ip s ( r) of the state |s) becomes 


(r|f7ji(d#)|s) = ip s ( r'), 


(7.28) 


where r' = a z 1 (c/#)r. Hence, 


Vt/r') =ifj s (x + yd9, y - xd9 , z). 


(7.29) 
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A Taylor series expansion up to first, order terms in cl6 would give 


= V>s(r) + ydO dlJ g^ - - nW >>l 


(7.30) 


In the position representation the quantity in parenthesis can be seen to be 

( d d \ 

\ y d- X - X d-y) = 

— iC z /Ti, 


(7.31) 


where P x and P y are the respective x and y components of momentum and L z is the z 
component of angular momentum (in its operator form in the position representation). 
Hence, we conclude that the operator form of Ur(c10) in any representation is 

U R (dO) = 1- iL z cl6/h. (7.32) 

The corresponding finite rotation by an angle 6 about the z axis is 

U R (9) = eM-iL z 0/h). (7.33) 

This can be generalized, as follows, for any rotation 6 about some direction given by the 
unit vector n. 

Ur{9) = exp (—Hi ■ h0/h). (7.34) 

Thus we see that angular momentum is the generator of rotation i.e. SO( 3). Hence, it must 
be conserved in a system that is rotationally symmetric. 


7.4.1 Eigenvalues of angular momentum 

As L z is conserved in a spherically symmetric system it will have simultaneous eigenstates 
with the hamiltonian (see problem 3 in chapter 4). Similarly, L x , or L y could also individ¬ 
ually share eigenstates with the hamiltonian. However, different components of L cannot 
share eigenstates as they do not commute. For example, if the position and momentum 
operators are R = (X, Y. Z ) and P = ( P x , P y , P ~) then 


[L Xl Ly] — [YP Z — ZP y ,ZP x — XP Z ] 
= Y[P Z , Z}P X + X[Z, P z \P y 
= ih(XP y -YP x ) 

= ihL z . 


(7.35) 
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Similarly, 


[L y ,L z \ = i%L x , (7.36) 

[L Z ,L X \ = ihL y . (7.37) 

The magnitude squared of the total angular momentum, L 2 (= L 2 + L 2 + L 2 ) commutes 
with each component. Hence, we can find simultaneous eigenstates for any one of the 
following three sets of operators: {H, L 2 , L x }, {H, L 2 , L y }, {H, L 2 , L~). In the following, 
without loss of generality, we shall choose the simultaneous eigenstates of {H, L 2 , L~}. 
These eigenstates, |c, d), are labelled by c, the eigenvalue of L 2 , and d, the eigenvalue of L z . 
The energy eigenvalue is suppressed as it will have a fixed value for the following discussion. 
Hence, 

L 2 \c,d) = c\c,d), L z \c,d) = d\c,d) (7.38) 

We can now find the possible values of the eigenvalues c and d. To this end we first define 
the operators 


T+ — L x iLy , (7.39) 

L_ = L x — iLy. (7.40) 

The commutators of L + and L_ can be obtained from equations 7.35, 7.36 and 7.37 to be 


[L+, L_] = 2 hL z , (7.41) 

[L+,L Z \ = -hL+, (7.42) 

[L-,L Z ] = UL-. (7.43) 

Then, 

L~L + \c,d) = L + L~\c, d) + hL + \c, d) 

= L + d\c, d) + hL + \c, d) 

= (d + h)L + \c, d). (7.44) 

As L 2 commutes with all angular momentum components we can also see that 

L 2 L + \c,d) = cL + \c,d). (7.45) 


Hence, it is seen that the operator L + “raises” the eigenstate |c, d) to another eigenstate 
with the L z eigenvalue greater by h while the L 2 eigenvalue remains the same. Thus one 
can write 

L + |c, d) = N d \c, d + h), (7.46) 

where Nd is chosen to maintain normalization of the eigenstates. Similarly, it can be shown 
that 


L_|c, d) = Md\c, d — h). 


(7.47) 
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For a given eigenvalue of L 2 , the eigenvalue of L z is expected to have both an upper limit 
and a lower limit. This can be seen from the fact that 

L_L_i_|c, d) = (L 2 + Ly + i[L x , Lj,])|c, d). (7.48) 

From equations 7.35, 7.46 and 7.47 this gives 

M d+h N d \c, d) = (L 2 X + Ly - HL z )\c, d) 

= (L 2 — L 2 Z — hL z )\c, d) 

= (c — d 2 — hd)\c, d). (7.49) 

We also know that (c, d|L_ is the adjoint of L + \c,d). Hence, 

\N d \ 2 = (c, d\L_L + \c, d) = M d+h N d , (7.50) 

and thus 

M d+h = N* d . (7.51) 

Then from equation 7.49 we see that 

c — d 2 — hd = \N d \ 2 > 0, (7.52) 

or 

d 2 + hd < c. (7.53) 

This shows that for a given c, d has a maximum and a minimum value. Let the maximum 

value for d be lh where l is dimensionless. Then 

L + \ Cl lh) = 0, (7.54) 


and from equation 7.49 we get 

0 = L_L + |c, lh) = (c — l 2 h 2 — lh 2 )\c, lh), (7.55) 

or 

c=l(l + l)h 2 . (7.56) 

As the eigenvalue of L~ has a minimum value for a given c, it is evident that a finite number 
of operations on |c, lh) by L_ should bring it down to the eigenstate of that minimum 
eigenvalue. If this finite number is n (a non-negative integer) then 

L_\c,lh — nh) = 0. (7.57) 


0 = L + L_ |c, (l — n)h) = (L 2 + L 2 -i[L x ,L y ])\c,(l - n)h) 

= (L 2 -L 2 z + hL z )\c,(l-n)h) 

= [c — (l — n) 2 h 2 T {I — n)h 2 ]\c, (l — n)h). (7.58) 


Hence 
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It then follows that 

c — (Z — ri){l — n — l)h 2 = 0. (7.59) 

Using equations 7.56 and 7.59 we obtain 

l (l + 1) — (l — n)(l n — 1) = 0, (7.60) 

or (as n is non-negative) 

l = n/2. (7.61) 

Hence, l can take either integer or half integer values and the eigenvalue of L 2 corresponding 
to it is l (l + l)h 2 . If the eigenvalue of L z is written as mh, m can take a maximum value 
of l. All other value of m differ from l by some integer. The minimum value of m is l — n 
which, from equation 7.61, can be seen to be —l. Now we can label the angular momentum 
eigenstates by the numbers l and m (instead of c and d) i.e | l,m) such that 

L 2 |/,m) = l(l + l)h 2 \l 1 m), (7.62) 

and 

L z \l, m) = mh\l, m), (7.63) 

where l is either an integer or a half integer and m takes the values l, l — 1, l — 2,..., —l. 
Hence, the total number of m values for a given l is 21 + 1. 

Thus we have shown that angular momentum eigenvalues are discrete. However, these 
results are obtained only from the commutators of the generators and further restrictions 
may apply on the allowed eigenvalues when finite rotations are considered. From equa¬ 
tion 7.33 a rotation of 27 t on a state | l,m) would give 

exp(— 2iriL z /h)\l, m) = exp(—27rz?n)|/, m). (7.64) 

If l (and consequently m) is a half integer this does not produce the expected identity 
transformation. Hence, half integer l is not allowed for the kind of state vectors we have 
discussed so far. It will later be seen that state vectors that include spin will allow half 
integer l values. 

Two relations that will be seen to be useful later are as follows. 

L + |/,m) = [1(1 + 1) — m(m + l)\ l ^ 2 U\l, m + 1) (7.65) 

L_\l,m) = [1(1 + 1) — m(m - l)\ l ^ 2 U\l, m — 1) (7.66) 

These can be obtained from equations 7.49, 7.50, 7.56 and 7.63 (see problem 3). 
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7.4.2 Addition of angular momenta 

If a system has two different components (e.g. a two particle system), each of which 
has a measurable angular momentum, the relation between individual component angular 
momenta and total angular momentum is nontrivial in quantum mechanics. Hence, we 
shall discuss it here. Let the individual component angular momenta be Li and L 2 and 
the total angular momentum be L such that 



L — Li + L 2 . 

(7.67) 

L+ 

= L\ + + L 2+ , 

(7.68) 

L_ 

= Li_ + L 2 — , 

(7.69) 

L z 

= L\ z + L 2z , 

(7.70) 

L 2 

= L\ + Li + 2Li • L 2 , 

(7.71) 


where the subscripts +, —, and 2 mean the same for the vectors Li and L 2 as they do for L. 
If the eigenstates of Li and L\ z are labelled as |Zi,mi) and that of L\ and L 2z as 1 l 2 , mn 2 ), 
then the combined system can be represented by the direct product \li,mi) <8> \l 2 , m - 2 ) of 
these states. A more compact notation for these direct products would be 

\h,mi) <g> \l 2 ,m 2 ) = \h,l 2 ,m u m 2 ) = |mi,m 2 ). (7.72) 

In the last form l\ and 1 2 are suppressed. This is convenient when the values of l\ and l 2 
are fixed for some computation. It is possible to choose l±, mi, l 2 , and m 2 as labels as the 
corresponding operators L\ z , L 2 , and L 2z commute with each other and hence have 
simultaneous eigenstates. Another set of commuting operators is {L \, L|, L 2 , L~}. Hence, 
a set of simultaneous eigenstates for these operators can be found. These eigenstates would 


be labelled as \l\,l 2 ,l,m) such that 

L\\lil 2 lm) = h(h + l)h 2 \lil 2 lm), (7.73) 

L%\lil 2 lm) = l 2 (l 2 + l)h 2 \hl 2 lm), (7.74) 

L 2 \lil 2 lm) = l{l + l)h 2 \lil 2 lm), (7.75) 

L z \lil 2 lm) = mh\lil 2 lm), (7.76) 


where the commas within the ket are omitted. We shall call the \l\hrn\rn 2 ) states of 
equation 7.72, the individual angular momenta states and the \lil 2 lm) states the total 
angular momentum states. As these two different sets of eigenstates describe the same 
system, they must be related as linear combinations of each other. The coefficients for such 
linear combinations are called the CLEBSCH-GORDAN COEFFICIENTS. If the common 
labels (Zi and l 2 ) of the two sets are suppressed, then one may write 

| Im) = ^2 \mim 2 )(mim 2 \lm) (7.77) 

mi m2 
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as Emim 2 \m 1 m 2 ){mirri 2 \ would be the identity in the subspace where l\ and 1 2 are fixed. 
From equation 7.77 we see that the coefficients (mim 2 \lm) are the Clebsch-Gordan coeffi¬ 
cients. 

We shall discuss some useful general results before computing these coefficients. Op¬ 
erating equation 7.77 by L z (= L\ z + L 22 ) and then multiplying from the left by ( 77 + 7712 ! 
one obtains 

m{mim2\lm) = (mi + m2)(mim2|Zm). (7.78) 

Hence, the coefficient (mim 2 \lm) can be nonzero only if 

771 = 7711 + 7712. (7.79) 

So for fixed values of h and I 2 , the largest value of m can be l\ + h- Also, as l is the largest 
value of m, the largest value of l is l± + 12 ■ Hence, for l = m = l± + I 2 , equation 7.77 would 
become 

l^i + hi h + h) = \hh)(hh\h + hi h + h)- (7.80) 

As all eigenstates are normalized, one could choose (hh\h + hih + h) = 1- This would 
give 

\h + hih + h) = \Wh)- (7.81) 

In the last two equations, one notices a possibility of confusion in notation of the individual 
angular momenta states and the total angular momentum states. To avoid such confusion, 
we shall always have the individual angular momenta kets on the right side of an equation 
unless they are part of an inner product. Similarly, the total angular momentum kets will 
always be placed on the left side of an equation. From equation 7.71 it can be seen that l 
can take different values for fixed values of l± and h • However, as l is the maximum value of 
m (which is mi + m 2 ), it can be less than l\ + h only by a positive integer. The minimum 
value of l is |£i — h |- This can be seen from the fact that the number of eigenstates for the 
individual angular momenta must be the same as that for the total angular momentum (for 
fixed h and h) as both sets of eigenstates form a complete set spanning the same space 
(see problem 5). We can now illustrate a general method of computing Clebsch-Gordan 
coefficients through an example. If l\ = 1, and h = 1/2, then from equation 7.81 we get 

|3/2,3/2) = |1,1/2). (7.82) 

Multiplying this by (= Li_ + L 2 -) gives 

L_|3/2,3/2) = Li_|1,1/2)+L 2 _|1,1/2). (7.83) 

Using equation 7.66, for both individual angular momenta and total angular momentum, 
one obtains 

V3|3/2,1/2) = V2|0, 1/2) + |1, -1/2), (7.84) 

13/2,1/2) = ^2/3(0,1/2) + /L/3|l, -1/2). 


or 


(7.85) 
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This gives the following Clebsch-Gordan coefficients. 

(0,1/213/2,1/2) = y / 2/3, (7.86) 

(1, —1/2|3/2,1/2) = yi/3. (7.87) 

Operating on equation 7.85 by once again would give 

213/2, —1/2) = y^73v^|-l,l/2) + ^/3|0,-l/2) 

+y / l/3\/2|0,-l/2) + 0, (7.88) 

or 

|3/2, -1/2) = yi/31 - 1,1/2) + y^2/3|0, -1/2). (7.89) 

Operating again by gives 

|3/2, —3/2) = | — 1, —1/2). (7.90) 

The next possible value for l is (3/2 — 1) = 1/2. In this case the state of highest possible 
m is 11/2,1/2). Due to equation 7.79, this state could be written as the linear combination 

11/2,1/2) = a|0,1/2) + 6|1, —1/2). (7.91) 

This state must be orthonormal to all other total angular momentum states and in partic¬ 
ular to 13/2,1/2) as given by equation 7.85. Hence, we obtain 

11/2,1/2) = yi/3|0,1/2) - ^2/3|l, -1/2). (7.92) 

Operating this by gives 

11/2, -1/2) = yj2/Z\ - 1,1/2) - yi/3|0, -1/2). (7.93) 

This completes the computation. The Clebsch-Gordan coefficients for l\ = 1 and p = 1/2 
as obtained in the equations 7.82, 7.85, 7.89, 7.90, 7.92 and 7.93, can now be summarized 
in the following chart. 


l 

m 

3/2 3/2 1/2 3/2 1/2 3/2 

3/2 1/2 1/2 -1/2 -1/2 -3/2 

mi m 2 

1 1/2 

1 

1 -1/2 

Ico 

|cN 

1 

Ico 

[t-H 

0 1/2 

Ico 

[t—1 

Ico 

[cN 

0 -1/2 

V^/3 — n/173 

-1 1/2 

s/TJS V^/3 

-1 -1/2 

1 


(7.94) 
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7.5 Discrete symmetries 


A discrete subgroup of a continuous symmetry group can always be defined by choosing 
the group parameter at periodic intervals (see problem 7). However, here we are going 
to discuss some discrete symmetries that are not subgroups of continuous groups. Such 
symmetries are not associated to any conserved quantities, as they have no generators. 
The following two discrete symmetries are of general importance in physics. 

7.5.1 Space inversion 

The space inversion operator, I s , has the following operation on the position vector r. 

I s r = —r. (7.95) 

The quantum states representation of I s will be called Ui and for an arbitrary state |s) 

(r|t/j|s) = j(—r\s) = j^ s (-r). (7.96) 

The extra factor j is needed due to the discreteness of the symmetry. In continuous sym¬ 
metry operations the value of j is unity as in the limit of all symmetry parameters going to 
zero the wavefunction must stay unchanged. For a discrete symmetry such a limit cannot 
be defined. For the spinless particles that we have discussed till now, two space inversions 
should produce the original wavefunction i.e. 


Uf\s) = s), for any s). 

(7.97) 

Hence, from equation 7.96 we get 


-H 

II 

r-H 

II 

CM 

(7.98) 


The value of j is called the INTRINSIC PARITY of the system. 


Theorem 7.4 The energy eigenstates of an inversion symmetric system can be chosen 
such that they change at most by a sign under the inversion operation. 

Proof: If | E) is an eigenstate of energy with eigenvalue E, then from the definition of a 
quantum symmetry Uj\E) is also an eigenstate with the same eigenvalue. Hence, the 
following are also eigenstates of energy with the same eigenvalue. 

|£i) = | E) + Uj\E), \E 2 ) = | E) - U:\E). (7.99) 

From equations 7.97 and 7.99 it can be seen that 

U^Ef) = +| Ei), U!\E 2 ) = -| E 2 ). (7.100) 


This proves the theorem. 
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Definition 37 The energy eigenstates of the type \Ei) in equation 7.100 are called SYM¬ 
METRIC and they are also said to have POSITIVE (TOTAL) PARITY. The eigenstates 
of the type \E 2 ) in equation 7.100 are called ANTISYMMETRIC and they are also said to 
have NEGATIVE (TOTAL) PARITY. 

It should be noted that the intrinsic parity is included in the total parity. However, the 
intrinsic parity of particles cannot be absolutely determined. The intrinsic parities of some 
particles have to be assumed and then those of others can be determined if the system is 
inversion symmetric (i.e. total parity is conserved). 

From equation 7.97 we see that Uj = Uj and hence, in the position representation 
(using equations 7.96 and 7.98) 

UjrUiip s (r) = Ujrjip s (-r) = -ri) s (r). (7.101) 

As this is true for any state ip s (r), the following must be true for the position operator R. 

UjRUi = R. (7.102) 

Similarly, for the momentum operator P 

UjPUr = P. (7.103) 

Hence, for the angular momentum operator, L = R x P, one obtains 

UjLUr = L. (7.104) 


7.5.2 Time reversal 


The time reversal operator is expected to be different in nature from all other symmetry 
operators discussed upto now. This is due to the fact that the Schrodinger equation is first 
order in time and hence, a time reversal would change the sign of only the time derivative 
term. To be precise it can be seen that the time reversal operator, T, is antiunitary. We 
have seen this to be possible from theorem 7.3. However, we have also seen that continuous 
group transformations cannot be antiunitary. So, due to its discrete nature, it is possible for 
T to be antiunitary. To demonstrate the antiunitary nature of T, let us consider the energy 
eigenstate \E) at time t, = 0 that has the eigenvalue E. The result of a time translation of 
f followed by a time reversal must be the same as that of a time reversal followed by a time 
translation of —t. Hence, from equation 7.23 


TU t (t)\E) 
T exp(— iHt./h)\E) 
T exp(— iEt/h)\E) 
T exp(— iEt/h)\E) 


U t (—t)T\E), 

exp(iHt/h)T\E), 

exp(iEt/h)T\E), 

[< exp(-iEt/h)]*T\E) 


(7.105) 
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As the above equation is true for any t and E, an arbitrary state |s) that can be written 
as the linear combination 

\s) = J2^e\E), (7.106) 

E 

would be time reversed as 

T\ S )=Y / a* E T\E). (7.107) 

E 

Also the arbitrary state 

\r) = Y,b E \E), (7.108) 

E 

is time reversed as 

T\r) = Y / b* E T\E) (7.109) 

E 

Hence, 

(r\T^T\s) = £ b E ,a* E (E'\7*T\E). (7.110) 

E’E 

Due to time reversal symmetry the set of states {T\E)} for all E would be the same as the 
set of states {|T)} for all E. Hence, 

(E'\T^T\E) = 8 e , e (7.111) 

and then from equation 7.110 it follows that 

(r\T^T\s) = YbEa* E =(s\r). (7.112) 

E 

This demonstrates, from definition, that T is an antiunitary operator. 

Under a time reversal, one expects the position operator to stay unchanged and the 
momentum operator to change in sign. Thus 

TR = RT TP = -PT. (7.113) 

Hence, the angular momentum L = R x P has the property 

TL = -LT. (7.114) 


Problems 

1. Prove corollary 7.1. 

2. Show that the momentum eigenstates of a particle stay physically unchanged by a 
space translation. 
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3. Derive the equations 7.65 and 7.66. 

4. For a system of two angular momenta with given magnitudes of individual angular 
momenta and 1 2 fixed) show that the number of angular momentum eigenstates 
is ( 2 /i + 1 )( 2^2 + 1 ). 

5. Show the minimum value of l for the total angular momentum states is \l± — I 2 |. [Hint: 
For fixed values of li and I 2 , the number of eigenstates of both the individual angular 
momenta and total angular momentum are the same.] 

6 . Find the Clebsch-Gordan coefficients for l\ = 1 and I 2 = 1. 

7. A periodic potential F(r) has a three dimensional periodicity given by the vector 
a = (nidi, 7^202,71303 ) where ai, a 2 and 03 are fixed lengths and m, 712 and 713 can 
take any integer values such that 


V(r + a) = H(r). 

(a) Show that the discrete translation symmetry operators 

Ud( a) = exp(—iP • a/h) 
commute with the hamiltonian. 

(b) Show that the set {£/<j(a)} for all possible integers (tii, 77 , 2 , 713 ) in a form a group. 

(c) The Bloch states are defined by their position representation 

Ub = u( r) exp(ip • r/h), 

where u( r + a) = u{ r) and p gives three labels for such states. Show that these 
states are physically unchanged by Ud{ a). 



Chapter 8 


Three Dimensional Systems 


The generalization of problem solving methods to three dimensions is conceptually simple. 
However, the mathematical details can be quite nontrivial. In this chapter we shall discuss 
some analytical methods in three dimensions. Quite obviously, such methods can have 
only limited applicability. However, the analytical solution of the hydrogen atom problem 
provides a better understanding of more complex atoms and molecules. Numerical methods 
in three dimensions can either be based on the analytical hydrogen atom solution or be 
independent of it, according to the nature of the system. 


8.1 General characteristics of bound states 


The characteristics of bound states in one dimension can be generalized to three dimensions. 
If E < V at large distances in all directions, then the energy eigenvalues must be discrete. 
However, in three dimensions, there must be two other quantities, besides energy, that must 
also have discrete values. This is because in each dimension the condition of finiteness of 
the wavefunction will lead to some parameter being allowed only discrete values (using 
similar arguments as for energy in the one dimensional case discussed in chapter 5). For 
spherically symmetric potentials, it will be seen that the two extra discrete parameters are 
the eigenvalues of one angular momentum component (say L~) and the magnitude of the 
total angular momentum (L 2 ). A study of numerical methods (chapter 9) will clarify the 
nature of these discrete parameters for more general cases. 


96 
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8.2 Spherically symmetric potentials 


A spherically symmetric potential is defined to be invariant under any rotation about one 
fixed center of rotation. The center of rotation defines the origin of a convenient coordinate 
system. Spherical polar coordinates (r, 9, (j >) can be defined using this origin such that the 
corresponding rectangular coordinates are given by 

x = r sin 9 cos cj) 
y = r sin 9 sin 0 

z = ?’cos#. (8.1) 


In three dimensions the time independent Schrodinger equation has the following form. 

n 2 


2 m 


V u + Vu = Eu. 


( 8 . 2 ) 


For a spherically symmetric potential, V is a function of r alone. Hence, in spherical polar 
coordinates equation 8.2 would have the form: 


2 m 


1AI 

{ r 2 —\ 

r 2 dr 

V dr J 


d 


r 2 sin 


9 89 


, n d ^ 

sm6 W) } + 


8 2 


r 2 sin 2 9 8(j) 2 


■u + V ( r)u = Eu. (8.3) 


As V depends only on r, the following separation of variables for the function u is useful. 

u(r, 9 ,0) = R(r)Y (#, 0). (8.4) 


Inserting equation 8.4 in equation 8.3 and dividing by u gives 


1 cl 
R dr 


,clR 

dr 


1 

F 


+ 

8 


2 mr 2 


n 2 


[E-V(r)] = 


sm 


9 89 


sin 9 


8Y 

~89 


+ 


sin 


1 8 2 Y 


9 dcf) 2 


(8.5) 


As the left side of equation 8.5 depends only on r and the right side only on 9 and 0, it 
is evident that both sides of the equation must be equal to a constant (say K). Then we 
have the following two equations. 


1 d 


, dR 


r 2 dr 


1 8 

sin 9 89 


dr 


2m 


K 


- 21 ^ r— +- T [E-V(r)lR--,R = 0. 


h 


sml> a¥ ] + 


1 8 2 Y 

sin 2 9 84> 2 


+ KY = 0. 


( 8 . 6 ) 

(8.7) 


Equation 8.7 is independent of the potential. Hence, we shall solve it first. The variables 
of Y(9, (f>) can be further separated as follows. 


T(M) = ©(#)<!> W>). 


( 8 . 8 ) 
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This would lead to the following two separated equations. 


1 d 
sin 6 d9 



+ 




0 , 

0 . 


(8.9) 

( 8 . 10 ) 


The separation constant m 2 is like the K of the previous separation of the variable r. This 
m is not to be confused with the mass of the particle. The same symbol is used to maintain 
standard notation and the context of usage is seen to remove ambiguity. Equation 8.9 can 
be readily solved to give 

$ = Aexp(irruj)) + L?exp(— im<j)) for m^O, 

<f> = A + B(j) for m = 0 . ( 8 - 11 ) 

To maintain the continuity of the function, it is necessary to require that the value of the 
function be the same at 0 = 0 and 0 = 2n. This gives the only possible solutions to be 

4> = exp (im(t>), (8.12) 

where m is an integer. A constant coefficient is omitted here as it can be included in an 
overall normalization constant for Y. 


The solution of equation 8.10 is more involved. The following change of variables 
makes it easier to handle. 

w = cos 0. (8.13) 

The resulting form of equation 8.10 is 



(8.14) 


where P(w) = 0(9). As 9 belongs to the interval [0,7r], w must belong to the interval 
[—1,+1]. A standard series solution of equation 8.14 shows that P(w) is finite in this 
interval only if 

K = 1(1 + 1), (8.15) 

where l is a non-negative integer. As the wavefunction is necessarily finite, equation 8.15 is 
a required condition for allowed solutions. The allowed solutions of equation 8.14 for m = 0 
are the well known Legendre polynomials Pi(w ), where /, the order of the polynomial, is 
given by equation 8.15. For non-zero m, the allowed solutions of equation 8.14 are the 
associated Legendre functions P[ n that are defined as follows. 

rl\m\ 

P r = (1 - W 2 ) l m l/ 2 -—j- rP l (w) . 

1 v ; dw M v ; 


(8.16) 
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The PJ n can be seen to be non-zero only if 

\m\ < l. (8.17) 

Now, the possible solutions of equation 8.7 can be written as 


Yi m (9, 0) = Ni m Pl n (cos 9) exp (im<j>), 


(8.18) 


where 7V) m are the normalization constants. If the Ni m are chosen to be 


Nim = 


(21 + 1)(Z — \m\)\ 

47 t {1 + |m|)! ’ 


(8.19) 


then the Yj m are seen to be mutually orthonormal in the following sense. 

r 2n rn 

/ / Yi m Y Vm , smOdOdcj) = 8 w 8 mm '. (8.20) 

Jo Jo 

The functions Y) m are called the spherical harmonics. Some of the lower order spherical 
harmonics are as follows. 


> 0,0 

Y lfi 

Y h± i 
>2,0 
>2,±1 
> 2 , ±2 


1/VHt 



^^sm 2 6» exp (±2i(/>). 


( 8 . 21 ) 


8.3 Angular momentum 


In chapter 7, we found that a spherically symmetric system must have the three components 
of angular momentum as conserved quantities. Also, the operators L z and L 2 commute 
and hence have simultaneous eigenstates. In the position representation, the Y] m can be 
seen to be these eigenstates. By definition, 


L = R x P 


( 8 . 22 ) 
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and hence, in the position representation 


r , d 8 

L x = —in y— - z—), 


8z 

8 


dy 

d 


Lqj iTi ( z „ x - i , 

ox oz) 

T ■*! 9 9 

L~ = -in | x- - y- 


8y dx 

A transformation to spherical polar coordinates gives 


0 d 

L x = ih I sin d>— + cot 9 coscb— 
O0 o(p 


d 

L v = ih I — coscb— + cot0sinc() 

O0 


d_ 

' d(j)J ’ 


u = -mA. 


This also leads to the following expression for L 2 . 

L 2 = L 2 x +L 2 y + L 2 z 

. 2 f 1 d ( . 8\ 1 0 2 

[ sin 6 86 \ Sm ° 89 J + sin 2 6 8c,b 2 

From equations 8.7 and 8.15, it can now be seen that 

L 2 Y bn = 1(1 + l)h 2 Yi m , 

and from equation 8.18 one obtains 




(8.23) 


(8.24) 


(8.25) 


(8.26) 


(8.27) 


Hence, the eigenvalues of L z and L 2 are in accordance with the general results of chapter 7. 
Half-integer values of l are not allowed here. Later, it will be seen that half-integer values 
of l are possible only when the position eigenstates are degenerate (see chapter 12). 


8.4 The two body problem 


In realistic situations a spherically symmetric potential does not appear as a background 
potential for a single particle system. What is commonly encountered is a two (point) 
particle system with the force acting along the line joining the two particles. Such a system 
can be shown to be mathematically equivalent to a combination of two independent systems 
- a free particle system and a spherically symmetric one particle system. 
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The hamiltonian for such a two particle system would be 


p 2 

H t = Fl 


p 2 

+ ^ + V(r), 


2mi 2 m 2 


(8.28) 


where Pi and P 2 are the momenta of the two particles (with magnitudes Pi and P 2 ), ri 
and Y 2 are their positions and r is the magnitude of 


r = iq - r 2 . 

In the position representation, equation 8.28 gives 


H, = - 


n 2 


2mi 


-V? 


n 2 


2rn 2 


V| + V(r), 


(8.29) 


(8.30) 


where V 2 and V 2 have the meaning of the Laplacians for the position vectors iq and r 2 . 
The center of mass coordinates are defined by 


I V = 


miri + m 2 r 2 
mi + m 2 


(8.31) 


It can then be shown that the hamiltonian in terms of the center of mass coordinates r c 
and the relative coordinates r can be written as 


H< = 


h 2 h 2 

- ——V? — ——V 2 + V(v'), 
2 M c 2m K h 


where V 2 is the Laplacian in r c and V 2 is the Laplacian in r. Also, 

77117712 


M = 771 1 + 7712 1 m = 


Now H t can be written as 
where 


h 2 


mi + 7712 

H t = H c + H, 

n 2 


H c = --—V 2 , H = -—V 2 + V(r). 
2M c ’ 2m v ' 


(8.32) 

(8.33) 

(8.34) 

(8.35) 


As H c depends only on r c and H depends only on r, one can find their eigenvalues E c and 
E, independently. Then the eigenvalues E t of Ht would be 


E t — E c + E. 


(8.36) 


H c is the three dimensional free particle hamiltonian. Hence, its eigenvalues are continuous 
(from a generalization of the one dimensional case). The eigenvalues E of H can be found 
by solving the differential eigenvalue problem 


Hu = Eu. 


( 8 . 37 ) 
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This can be seen to be equivalent to a one particle spherically symmetric problem in the 
relative coordinate r if the equivalent mass is taken to be m (the so-called reduced mass). 

If H has a discrete spectrum, it can be “washed out” by the continuous spectrum of 
H c if the energies E c are large. This is observed in the case of the hydrogen atom. The 
energies E c are due to the random motion of the centers of mass of the atoms. Hence, at 
high temperatures E c is higher. So, to observe the discrete spectrum (of E) of the bound 
states formed by the electron and the proton of the atom, one needs to make observations 
at sufficiently low temperatures. Higher the temperature, broader will be the spectral lines 
observed. 


8.5 The hydrogen atom (bound states) 


The hydrogen atom is a two particle system (one electron and one proton) as discussed 
above. Its reduced mass can be seen to be almost equal to the electron mass as the proton 
mass is much larger. The electrostatic potential energy is 

V(r) = (8.38) 

r 

where k e = l/(4neo), eo is the permittivity of free space and e is the proton charge. Using 
this potential one can find the bound state energies E from equation 8.37. The form 
of the potential shows that bound states must have E < 0. With this condition one 
can solve equation 8.37 using the general method of section 8.2. The angular part of 
the eigenfunctions is already known. The radial part in the present case would be (from 
equations 8.6, 8.15, 8.33 and 8.38) 


1 cl I 2 dR 
r 2 dr \ dr 


+ 


2 mk e e 2 
h 2 r 


R + 


2 mE 


R- 


1(1 + 1 ) 

r 2 


R = 0. 


(8.39) 


This equation can be solved in a manner very similar to the one dimensional harmonic 
oscillator problem. To simplify the form of the equation, the dimensionless parameter s is 
defined as 


s = ar, a 2 = —8mE/h 2 


(8.40) 


With the definition 

. 2mkf,e 2 k e e 2 I—m 

l3= ^r = —\h* 

equation 8.39 becomes 


(8.41) 


1 d / 2 dR\ 'j3 1 
s 2 ds\ ds ) ^ _ s 4 


Z(* + l)l 

c 2 


R = 0. 


(8.42) 


The large s behavior of R can be seen to be of the form exp(—s/2). Hence, we choose 


R(s) = F(s) exp(—s/2). 


(8.43) 



CHAPTER 8. THREE DIMENSIONAL SYSTEMS 


103 


From equations 8.42 and 8.43 one obtains the differential equation for F to be 

d 2 F /2 \ dF 

ds 2 + \.s / ds + 

The following series form for F is chosen. 

OO 

F = s p J2 a i s ^ °o^O. (8.45) 

i =0 


P-1 1(1 + 1) 


F = 0. 


(8.44) 


On substituting this in equation 8.44 and equating the total coefficient of the lowest power 
of s to zero one obtains p = l or p = —(l + 1). As p = —(l + 1) would make the eigenfunction 
at the origin infinite, we must choose p = l. In a manner similar to the harmonic oscillator 
problem, the substitution of the series solution in equation 8.44 results in a recursion relation 
for the di s. 


(i + l + 1 — /3)cij 
a * +1 = (i + l)(i + 2Z + 2) ’ 


(8.46) 


Once again it can be shown that unless the series terminates, the function F rises fast 
enough at infinity to make R go to infinity at infinity. Hence, for an eigenstate the series 
must terminate. This will happen if for some non-negative integer n' 


(3 = n! + l + 1 = n. 


(8.47) 


Hence, from equation 8.41 the corresponding condition on the energy is seen to be (a 
subscript n is added to E to identify the energy level) 


and n > l. 


E„ = — 


mk 2 e 4 
2 h 2 n 2 


n = 1, 2, 3,... 


(8.48) 


The differences of these energy eigenvalues are observed as the energies of photons 
emitted by excited atoms. The resulting discrete line spectrum produced by hydrogen is of 
great historical importance. The part of the spectrum caused by transitions to the n = 2 
level, from higher levels, falls in the visible region and its distinct pattern was noticed early 
on (the Balrner series). Its dramatic explanation was a triumph of quantum mechanics. 
The extension of these quantum mechanical computations to multi-electron atoms turns 
out to be too complex for an exact treatment. However, phenomenologically motivated 
approximations are very useful in the study of atomic spectra[6]. Such spectra have been 
used for a long time to identify elements in a mixture - in particular, when the mixture is 
somewhat inaccessible for chemical analysis (as in stars!). The study of finer structure in 
atomic spectra has led to the understanding of other quantum phenomena like electron spin 
(which produces the so-called “fine structure”), proton spin (which produces the “hyperfine 
structure”), and also quantum field interactions (which produce the “Lamb shift”). 
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The functions, R, depend on both n and l and hence they are labelled as R n [. They 
can be found from the recursion relation of equation 8.46 and substitution in equations 8.43 
and 8.45. Some of the R, ri j are as follows. 


Rio(r) 

= 2a~ 3 ^ 2 exp(— r/a), 


R2o(r) 

= (2a) _3 / 2 (2 — r/a) exp(—r/(2a)), 


R 21 (r) 

_ 3~ 1 / 2 (2a)~ 3//2 (r/a) exp(—r/(2o)). 

(8.49) 


where a = h 2 / (mk e e 2 ). 

One notices that the energy eigenvalues given by equation 8.48 depend only on the 
quantum number n and not on the other two quantum numbers m and l. The eigenfunctions 
Unim (= Rnl^lm ); on the other hand, are different for different values of n, l, and m. This 
means that there must be some degeneracy in eigenstates. For a given l the number of m 
values is (21 + 1). Also, for a given n, l takes values from 0 up to n — 1. This gives the total 
number of states for a given n to be 


n— 1 

d=^2( 2l + l) = n 2 . (8.50) 

1=0 

Hence, the energy eigenvalue E n (sometimes called the n-th energy level) is n 2 -fold degen¬ 
erate. The inclusion of spin (chapter 12) makes this degeneracy 2n 2 -fold. 


8.6 Scattering in three dimensions 


In chapter 5, we defined scattering states as the ones for which E > V at infinity in 
some direction. The same definition can be used in three dimensional problems as well. 
However, for the sake of simplicity, in this chapter, we shall consider only the scattering 
states that have E > V in all directions at infinity. In chapter 5 it was also shown that, 
for scattering states, the experimentally significant quantity is the scattering cross section 
given by equation 5.6 or equation 5.21. 

As discussed in section 8.4, realistic potentials are usually not static background po¬ 
tentials. For two-particle systems, the potential is due to the interaction of one particle 
with another. Such systems can be reduced (in the fashion of section 8.4) to two indepen¬ 
dent systems - one a free particle and the other a particle in some effective background 
potential. The free particle part of the energy is seen to be zero in the center of mass 
frame. Hence, in a scattering problem it is easier to compute the cross section in the center 
of mass (CM) frame. This makes it necessary to determine a conversion factor between the 
CM frame cross section and the laboratory frame cross section. 
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Figure 8.1: Two particle scattering in (a) laboratory frame (b) CM frame. 
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8.6.1 Center of mass frame vs. laboratory frame 


Fig. 8.1a shows a standard laboratory frame setup for a scattering experiment. The target 
particle of mass m 2 is initially at rest. The incident particle of mass mi is moving along the 
positive 2 direction (horizontal) at a velocity v (magnitude v) initially. After the collision, 
the incident particle moves in a direction given by the spherical polar angles (0o,0o)- Its 
velocity is Vi (magnitude 17 ). The center of mass moves at the velocity v 7 (magnitude v'). 
It can be shown that 


. miv 
v = -. 

mi + m 2 


(8.51) 


Fig. 8.1b shows the same experiment in the CM frame. In this frame, before the collision, 
the target particle moves at a velocity of —v 7 and the incident particle at m 2 v 7 /mi. After 
the collision, the incident particle moves at the velocity v" (magnitude v") which has a 
direction given by the spherical polar angles ( 0 , 0 ).This is the scattering direction in the 
CM frame. For an elastic collision, in the CM frame, it can be shown that the magnitude 
of the velocity of each particle must be the same before and after collision. Hence, 


,, m 2 u 7 m 2 v 
mi mi + m 2 

From the definition of the CM frame we note that 


(8.52) 


Vi = v 7 + v 77 . 


(8.53) 


Writing equation 8.53 in component form gives us the relation between the laboratory frame 
angles (0 q, 0 O ) and the CM frame angles (0, 0). 


V\ cos 0o = v 1 + v" cos 0, 
v± sin 0 o = v" sin 0 , 


Hence, 

where 

For elastic collisions it can be seen that 


O- 

0 

II 

-e- 

(8.54) 

„ sin 0 

tan = . a , 

7 + cos 0 

(8.55) 

7 = v'/v". 

(8.56) 

that 


7 = mi/m 2 . 

(8.57) 


If the collision is inelastic, it is not necessary that the number of particles be the same before 
and after collision. Hence, the above analysis, would not work in general. At this point we 
shall not digress into the analysis of general inelastic collisions. However, the case of the 
inelastic collision with two product particles of masses m3 and 771,4 (mi + m 2 = m3 + 7714), 



CHAPTER 8. THREE DIMENSIONAL SYSTEMS 


107 


is not too different from the elastic case. In such a situation one needs to use the following 
expression for 7 in equation 8.55. 


7 = 


mim^E 


m 2 m A (E + Q)'' 


(8.58) 


where Q is the amount of internal energy converted into additional kinetic energy for the 
product particles. Q is negative for endothermic reactions. 


It is clear that the number of particles scattered in the same element of solid angle 
should not appear different in different frames of reference. Hence, we conclude 


a 0 ( 6 0 , 4>o) sin 9od9 0 d(j)o = a(6, <j>) sin OdOdcj). (8.59) 

Now using equation 8.55 and the last of the set of equations 8.54, it can be shown that 


,a x\ (1 +7 2 + 2q COS 0) 3 / 2 
= - 1 + 7 cos 


(8.60) 


8.6.2 Relation between asymptotic wavefunction and cross section 

Now that we know the relation between the laboratory frame and the CM frame measure¬ 
ments of cross section, all computations can be done in the CM frame. Equation 8.37 would 
be the relevant equation i.e. 

- V 2 u + Vu = Eu, (8.61) 

2m 

where m = mim 2 /(mi + m 2 ) and E (the energy in the CM frame) is related to Eq (the 
energy in the laboratory frame) as follows. 

E = rn 2 E 0 

mi + m 2 

In most scattering experiments, the region of space where a measuring instrument can be 
placed is at large values of r where V is effectively zero. Hence, one is interested in the 
solution of equation 8.61 in such “asymptotic” regions where r is very large. If the incident 
beam is a plane wave of fixed momentum moving in the positive 2 direction, then the 
expected form of the asymptotic solution is 

u(r, 8, (/)) = A[exp(ikz) + r~ l f{8, (f> ) exp (ikr)\, 

where 

z = rcos8, k = p/U = V2mE/h. (8.64) 

The first term in equation 8.63 is the incident beam which is a momentum eigenstate 
(exp(ip • r/h)) with p, the momentum, directed along the 2 direction. The second term is 


(8.63) 


(8.62) 
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the scattered wave in the lowest order of r -1 . Higher order terms of r -1 donot contribute, 
as the particle current due to them tends to zero at large distances within a fixed solid 
angle. Only inverse powers of r are expected in the scattered wave as it must die out at 
large distances. 

Normally, the incident beam is collimated to have a width large enough to maintain the 
plane wave nature near the target particle, but not large enough to produce any significant 
readings on the detectors that are placed at an angle away from the direct beam. Hence, 
the detectors observe only the second term in equation 8.63 and the observed dn/dw of 
equation 5.21 is seen to be (using equations 5.16 and 5.20) 

E = -M|V(M)| 2 . (8.65) 

du m 

As the incident beam is only the first term in equation 8.63, the N of equation 5.21 is found 
to be (using equations 5.16 and 5.20) 


N = hk\A\ 2 /m. (8.66) 

Hence, from equation 5.21, the scattering cross section evaluates to 

a = \f(9,<l>)\ 2 - (8-67) 

Thus, f(6, (f> ) is the quantity to be computed. 


8.7 Scattering due to a spherically symmetric potential 


For a spherically symmetric potential the solution for u(r, 0, 0) is expected to have a cylin¬ 
drical symmetry about the z axis and hence it must be independent of 0. Thus, from the 
analysis of section 8.2, one may write 

OO 

u(r,6) = ^2(21 + l)i l Ri(r)Pi(cos9), (8.68) 

1=0 


where the constant coefficient of each term is chosen for later convenience and Ri(r) is a 
general solution of equation 8.6 which can also be written as 


]_d_ ( odRA [ 2 _ 2 mV{r) 
r 2 dr V dr ) % 2 


*(* + 1)1 

r 2 


Ri = o. 


(8.69) 


where k 2 = 2 mE/h 2 . Due to the scattering nature of the wavefunction, one knows that E 
and hence k 2 are positive and also V tends to zero at large r. One can usually consider V 
to be effectively zero if r is greater than some constant a. In this interaction free region 
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the solution for R t can be found to be a linear combination of the spherical Bessel and the 
spherical Neumann functions: 

Ri(r) = Ai [cos 8iji(kr) - sin <5^ (At)]. (8.70) 

The spherical Bessel functions ji can be written in terms of the Bessel functions J; as 
follows. 

ji{kr) = (2kr/Tr)- 1/2 J l+1/2 (kr). (8.71) 

The spherical Neumann functions ri/ can be similarly written as 

m(kr) = (-l) l+1 (2kr/Tr)~ 1/2 J_i_ 1/2 (kr). (8.72) 

The asymptotic forms of ji and ni are given by [7] 

ji(kr) —> (kr)- 1 cos [At — (l + l)vr/2], (8.73) 

ni(kr) —> (kr)- 1 sin[Ar — (l + l)7r/2]. (8.74) 

Thus the asymptotic form of equation 8.70 is 

Ri(r) —> (kr)- 1 Ai sin(kr — Itt/2 + 6i). (8.75) 

To compare the actual solution to the form chosen in equation 8.63, one needs the following 
expansion of the incident wave in terms of the spherical Bessel functions. 

ex.p(ikz) = exp (ikr cos 9) 

OO 

= ^T / (2l + l)i l ji(kr)Pi(cos9). (8.76) 

1=0 

As the asymptotic form of equation 8.68 must be the same as equation 8.63, one can write 
(using equations 8.73, 8.75 and 8.76) 

OO 

^(2A + l)^(Ar) -1 sin(Ar — hr/2)Pi(cos 9) + r~ l f(9) exp (ikr) 

1=0 

OO 

= ^(2 1 + 1)^A;(At) -1 sin(Ar — hr/2 + 6i)Pi(cos9). (8.77) 

1=0 

Comparing the coefficients of exp(iAr) and exp(— ikr) one then obtains 

OO 

2 ikf(9) + ^2(21 + l)i l exp(— iln/2)Pi (cos 9) 

1=0 

OO 

= Yj2l + l)i l Ai exp(i6i — ihr/2)Pi (cos 9), (8.78) 

1=0 
OO 

^(2Z + 1 )i l exp (ilir/2)Pi (cos 9) 

1=0 

OO 

= ^(2Z + 1 )i l Ai exp (—i6i + ihr/ 2 )Pi (cos 9). 

1=0 


(8.79) 
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As equation 8.79 is true for all 6, it can be true only if 

Ai = exp (i6i). 

Substituting equation 8.80 in equation 8.78 gives 

OO 

f(0) = (2 iky 1 Y( 2 1 + l)[exp(2i<5 z ) - l]P z (cos0). 
1=0 

Hence, the cross section is 

<9) = \fm 2 

oo 2 

= k~ 2 Yj2l + 1) exp (i8i) sin 61 Pi (cos 9) . 

1=0 

The total cross section, which is defined as 


a t= c(0, 4>)duj = 


2n /*7r 


JO JO 


a(8, (f >) sin 8dQdq i, 


(8.80) 


(8.81) 


(8.82) 


(8.83) 

(8.84) 


would then be 

a t = 2n I a(6) sin Odd = 4irk~ 2 Y(2.1 + 1) sin 2 <5 Z . 

Jo 1=0 
The computation of the cross section in equations 8.82 and 8.84 would require the knowledge 
of the phase shift angles <5 Z . These can be found from the continuity of Ri(r) and its 
derivative at r = a. We have already defined a as the distance beyond which the potential 
V(r) is effectively zero. If the solution for r < a is found by some analytical or numerical 
method and then the corresponding value of (1 /R{)(dRi/dr) at r = a is determined to be 
7 1 , then equation 8.70 and the continuity condition would give 


kUifta) cos 6i — n[(ka ) sin <5 Z ] 
ji(ka ) cos 6i — ni(ka) sin 6i 


= Ih 


(8.85) 


where j[ and n[ denote the derivatives (with respect to their arguments) of the spherical 
Bessel and Neumann functions respectively. Solving for 6i gives 


tan 61 


kj'i(ka) ~ 7 dijka) 
kn'Xka ) — 7 Z n z (A:a) 


( 8 . 86 ) 


In summary, it should be noted that the computation of the scattering cross section by this 
method can become rather tedious if the sum of the series in equation 8.82 converges slowly. 
This and the fact that the solution of Rj for r < a cannot always be found analytically, tells 
us that such a method is usually suitable for a computing machine. For specific potentials, 
it is sometimes possible to use other methods to directly obtain an analytical solution for 
the cross section. An example is the scattering due to a repulsive 1 /r type of potential [8]. 
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Problems 


1. Derive equation 8.32 from equation 8.30. 

2. From equation 8.42, show that, for large s, R has the form exp(— s/2). 

3. Show that if the series expansion for F (equation 8.45) does not terminate, the solution 
is unacceptable. 

4. Obtain the functions in equation 8.49 from the recursion relation of equation 8.46 
and the condition of equation 8.47. 

5. Derive equation 8.58. 

6 . Find the phase shifts <So and 6± for scattering from a potential given as follows. 

V = Vo for r < a 

V = 0 for r > a 


where Vo is a constant. Also determine the conditions under which these phase shifts 
become infinite. Explain such conditions physically. 


Ms) 

n 0 (s) 

hi*) 

ni(s) 


s -1 sin s, 
cos s, 

s~ 2 sin s — s ' 1 cos s, 
—s~ 2 cos s — sin s. 



Chapter 9 


Numerical Techniques in Three 
Space Dimensions 


For the numerical solution of three dimensional problems, one can very often draw from 
the methods developed for one dimensional problems. However, the increase in the number 
of space dimensions can just as often require qualitatively new numerical techniques. For 
example, the three dimensional scattering problem is complicated by angular momentum, 
which is not defined in one space dimension. In the following, for each of the two cases of 
bound states and scattering states, we shall first discuss the simpler spherically symmetric 
problem and then the general problem. 


9.1 Bound states (spherically symmetric potentials) 


For spherically symmetric bound state problems the angular part of the solution is already 
known analytically (chapter 8). The radial part of the wavefunction depends on the func¬ 
tional form of V(r), the potential. For arbitrary V(r), a numerical solution of equation 8.6 
can be obtained in a manner similar to one dimensional problems. The method will be 
illustrated here using the hydrogen atom potential, so that results can be compared to 
analytical results. 


For the hydrogen atom, equation 8.6 takes the form of equation 8.42 with the appro¬ 
priate transformation to the dimensionless independent variable s. Equation 8.42 can be 
rewritten as 


d 2 R 2 dR r /3 1 

ds 2 + s ds + s 4 


1(1 + 1 )] 
c 2 


R = 0. 


(9.1) 


The initial conditions for this equation will be given at s = 0, and the solution need be 
found only for s > 0. Hence, the method of solution will be similar to the one dimensional 
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symmetric potential case. The potential still has reflection symmetries along each axis. 
But, as s is proportional to the magnitude of the displacement from the origin, it does not 
change sign in a reflection. Hence, the solution will not be exclusively odd or even in s. 
Nonetheless, one of the two independent solutions can be discarded as follows. 

It is possible to choose the form of R to be 


R(s) = s p G(s), G( 0) + 0, 

where G'(0) is finite. If equation 9.2 is substituted in equation 9.1, one obtains 


G" + 2(p + 1 ) G ,; + 


(3 1 p(p + l ) -1(1 + 1) 

a ' •> 


G = 0, 


(9.2) 


(9.3) 


where G' = clG/ds and G" = d 2 G/ds 2 . For s 0, the above equation can be true only if 
the coefficient of each negative power of s tends to zero. For the s~ 2 coefficient, this gives 


As G'(0) ^ 0, this gives 


\p(p + l)-l(l + l)}G(0) = 0. 


p = l or — (l + 1). 


Similarly, for the s 1 coefficient in equation 9.3 one obtains 

2(p+l)G'(0) + /3G(0) = 0. 


(9.4) 

(9.5) 

(9.6) 


The second choice in equation 9.5 would make R go to infinity at the origin. This is not 
allowed and hence 

p = l. (9.7) 

Thus, one of the two independent solutions of R is discarded. Using equation 9.7 in equa¬ 
tion 9.6 one obtains 

G( 0 ) = (9.8) 


2(Z + 1) 


Using equation 9.7 in equation 9.3 gives 

G" + 2{l + 1) G' + 


s 4 


G = 0. 


(9.9) 


Using equations 6.2 and 6.8, the finite difference form of equation 9.9 can be seen to be 

G, + j — G,_ i 


Gi —i — 2 Gi + Gj+i 2 (l + 1) 


w* 


2 w 


+ (fl/ s i ~ 1/4 )Gi — 0, 


(9.10) 


where Gi and .s* are the values of G and s at the i-th point and w is the width of the 
interval. This can be rewritten as the recursion relation 


Gi+\ — 


[2 + (1/4 — (3/Si)w 2 ]Gi + [(l + 1 )w/si — l]Gj_i 
1 + (l + 1 )w/si 


(9.11) 
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As the normalization constant can be chosen arbitrarily without changing the eigenstates 
and the eigenvalues, one can choose 


G'q — G'(0) — 1, 


(9.12) 


for convenience. A finite difference form of equation 9.8 would be 


Using equation 9.12, this gives 


C\ — Gq 
w 


G, = 1 


SGq 

2(1 + 1)' 

(9.13) 

(3w 

2(1 + 1)' 

(9.14) 


With equations 9.12 and 9.14 as initial conditions, one can solve the recursion relation of 
equation 9.11 for all G, L . In a fashion similar to the one outlined in chapter 6, one can 
solve for G for a series of values of E, the energy (or equivalently /?), to locate the energy 
eigenvalues. The change in sign of the tail of the wavefunction identifies the location of the 
eigenvalues. 


9.2 Bound states (general potential) 


Physically interesting potentials can, in general, be seen to have some symmetry properties. 
This is due to two reasons. First, a system with symmetry attracts more attention. Second, 
systems that are built by physicist and engineers are built with symmetry in mind so that 
theoretical computations are easier. An example is the system of electrons in a uniform 
magnetic field (see chapter 4). Hence, in solving an arbitrary problem, the first step is 
to identify as many symmetries as possible. Next, a coordinate system that matches the 
symmetry needs to be chosen (like the spherical polar coordinates were chosen for the spher¬ 
ically symmetric potential). This allows the separation of the time independent Schrodinger 
equation to a certain degree. The resulting simplified equations can then be solved either 
analytically (if possible) or numerically. It is not practical to present numerical methods 
for all possible symmetry situations. Hence, in this section a method will be outlined for 
the problem with no identifiable symmetry. Although such a complete lack of symmetry 
is highly unlikely, this method can give useful hints for the construction of algorithms for 
systems with some arbitrary symmetry (see problem 5). 

For the general numerical algorithm, we need to write the complete Schrodinger equa¬ 
tion in its difference equation form. Using equation 6.8 in each rectangular coordinate 
direction, the Laplacian of u will have the following difference form. 

V|u(i,i, k) = w~ 2 [u(i + 1 ,j, k ) + u(i - 1 , j, k ) + u(i,j + 1 , k) 

+ u(i,j - 1 , k) + u(i, j,k + l)+ u(i , j, k — 1 ) — 6 u(i, j, k )}, 


(9.15) 
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where the discrete indices i, j, and k for the three directions are written in parenthesis to 
avoid long subindices. These indices are assumed to take all integer values with an arbitrary 
origin defined by all zero indices. This will require some manipulations if the algorithm is 
implemented in the C language as C does not allow negative array indices. The difference 
form of the time independent Schrodinger equation would then be 

2777/ 

Vd u (i,j,k) + — T [E- V(i,j,k)]u(i,j,k) = 0 . (9.16) 

n 

Due to the three different indices, this will not reduce to any solvable recursion rela¬ 
tion. However, it can be visualized as a set of linear algebraic equations for the unknown 
u(i,j, ky s. As an illustration of the method of solution [5], consider a cubic region of space 
in which the solution is to be found. Let i, j, and k each run from 0 to a maximum of i m , 
j m and k m respectively in this region. To find bound states, the boundaries of the region 
must be chosen at a large enough distance where u goes to zero. Hence, in the present 
case u(i,j,k) must be zero if at least one of the three indices is 0 or the corresponding 
maximum value - i m . j m or k m . The values of u(i,j,k ) at the interior (non-boundary) 
points are the unknowns that need to be determined. They are (i m — 1) (j m — 1) ( k rn — 1) 
in number. Equation 9.16 provides the same number of equations if written for all interior 
points. Thus, a solvable system is obtained. In order to write these equations in a matrix 
form, the interior points must be suitably numbered such that the u(i,j,ky s appear as a 
one dimensional array. One possibility is illustrated by the following definition of the array 
U given for i m = j m = k m = 3. 

U\ = u(l, 1,1), U 2 = u( 1,1,2), U 3 = u( 1,2,1), U 4 = u( 1,2,2), 

U 5 = u( 2,1,1), U 6 = u( 2,1,2), U 7 = u( 2,2,1), U 8 = u( 2,2,2). (9.17) 

Of course, practical choices for i m . j m and k m must be much larger to achieve reasonable 
accuracy. 

The boundary values being zero, the linear equations for U turn out to be hornoge- 
noeus. Hence, for a non-zero solution, the related matrix must have a zero determinant. As 
this matrix contains the energy eigenvalue E, the zero determinant condition will provide 
the possible energy eigenvalues and the related eigenfunctions. This is a matrix eigenvalue 

problem for which standard numerical methods are available [5]. In general, for a large 

enough number of interior points, this will require significant amounts of computer time 
and memory. However, some efficiency is achieved by making use of the sparse nature of 
the matrix. It is seen to be tridiagonal with fringes. Besides, one is usually interested in 
the lower eigenvalues and some time can be saved by focussing only on them. It is to be 
noted that this method can provide only the lowest ( i m — 1 )(j m — 1 ){k m — 1 ) eigenvalues 
and higher the eigenvalue the greater is its inaccuracy. This is due to the rapid oscillations 
of the higher energy eigenfunctions which require smaller values of w to maintain accuracy. 

Another somewhat related method [9] is in some ways more intuitive. Instead of 
considering only the homogeneous boundary at infinity, one may include an arbitrary point 
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(normalization point) in the interior as part of the boundary. The value of the function 
at this point can be chosen to be an arbitrary non-zero constant using the freedom of the 
normalization constant. Now equation 9.16 can be written for all interior points other than 
the normalization point. This would result in an inhomogeneous matrix equation which can 
be solved for a given E. The search for the eigenvalues E requires solving the equation for 
a series of finely spaced values of E and then comparing them, in each case, to the value 
E n of E evaluated from equation 9.16 written for the normalization point. Every E that is 
the same as the corresponding E n , is an eigenvalue. 

It is to be noted that the methods discussed here provide only one quantum number 
- E. For the spherically symmetric case, n was related to E. However, there were two 
other quantum numbers (l and m) that were a direct result of the spherical symmetry. 
The general methods discussed here cannot identify these quantum numbers because, in 
general, there are no physically meaningful quantities that relate to these numbers. For 
comparison to symmetric cases, one may consider the following quantities, evaluated at 
some arbitrarily chosen position, to be the quantum numbers. 

, 2 1 d 2 u 2 1 d 2 U , 2 1 d 2 U 

K = -7TT, K = ~Tn- k z = T7TT' (9-18) 

u ox 2 y u ay 2 u oz 2 

where x, y and z are the usual rectangular coordinates. For a free particle system these 
quantum numbers are conserved quantities. The Schrodinger equation relates these quan¬ 
tum numbers to the energy eigenvalue E. Hence, one still has a total of three quantum 
numbers for a three dimensional problem. 


9.3 Scattering states (spherically symmetric potentials) 


It is to be noted that the computation method for scattering states described in section 8.7 
is more suitable for numerical computation rather than analytical computation. The infinite 
sum in equations 8.82 and 8.84 will, in general, require several terms in the series to be 
computed to get significant accuracy. Also the computation of the 7 1 of equation 8.86 would 
require the solution of a differential equation that is seldom expected to have analytical 
solutions for arbitrary potentials. Hence, here we shall discuss the numerical aspects of the 
same method. 


As an example, we shall consider the scattering of an electron due to a neutral atom of 
atomic number Z. The scattering potential of such a system can very often be approximated 
by 


V(r) = 


Ze 2 exp(— r/B) 
47 T 60 r 


(9.19) 


where e is the magnitude of the electron charge, eo the permittivity of free space, and B 
a measure of the range of the potential which is the “radius” of the atom. The first step 
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would be to solve equation 8.69 using this potential. The differential equation to be solved 
would be 

d 2 Ri 2 clRi 

+ — At 


where 


els 2 s els 
s = kr, k 2 = 


1 + 71 

2 2 mE 


n 2 


, 71 = 


As in section 9.1, we can write 


The equation for G would then be 


./6)_ICI + 1) -| 
s 2 

(9.20) 

2 mZe 2 

(9.21) 

9 , b - kB. 

4ireoh 2 k 

G( 0) + 0. 

(9.22) 


G" + 2 ( p + 1 ^ G' + 


1 + 1 exp(—s/b) + p(p + l) -/(/ + !) 


G = 0. 


Using arguments similar to section 9.1, one finds 

P = l, 

G\ 0 ) = 


TlG'(O) 


2(1 + 1 ) 


Inserting equation 9.24 in equation 9.23 gives 


G" + W±H G ' + 


1 + 71 


exp(— s/b) 


G = 0. 


(9.23) 

(9.24) 

(9.25) 

(9.26) 


From equations 6.2 and 6.8 one obtains the finite difference form of equation 9.26 to be 


Gi-i — 2 Gi + Gi -)_i 2(1 + 1) 

o ' 




G 


i +1 — Gi -1 


2 w 


+ 


1 + 71 


exp (-Si/b) 


Si 


Gi = 0. (9.27) 


where Gi and Si are the values of G and s at the i-th point and w is the width of the 
interval. This leads to the recursion relation 


G {+1 — 


[2 - (1 + Aexp(-Si/b)/si)w 2 ]Gi + [(l + 1 )w/si - l]Gj_i 


1 + (/ + 1 )w / Si 

Using the freedom of choice of the normalization constant, one chooses 

G'o = G(0) = 1. 

The finite difference form of equation 9.25 is 

G\ — G'o AG 0 


w 


2(1 + 1 )' 


(9.28) 


(9.29) 


(9.30) 
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From equations 9.29 and 9.30 one finds 


G 


i 


wA 

2(1 + 1 )' 


(9.31) 


The initial conditions in equations 9.29 and 9.31 allow us to compute G- L at all points using 
equation 9.28. One needs to know this solution up to a point (r = a or s = ka ) beyond 
which the potential is effectively zero. The choice of a depends on the desired accuracy. 
If this boundary point is reached at the n-th point in the numerical computation then 
s n = ka. Now, the 7 1 of equation 8.86 is defined as 


1 dR, 

k dR[ 

l kG' 

R[ dr 

r=a ds 

s=ka ® G 


Hence, the difference form of 7 1 would be 


(9.32) 


l k ( G n -\ \ 

7z = —I I 1 — -+r~ 

a w \ G n J 


(9.33) 


To find the phase shifts from equation 8.86 one needs equation 9.33, the spherical Bessel 
and Neumann functions and their derivatives. Standard series solutions for the Bessel and 
Neumann functions can be used for their numerical computation. Once the 6[ are known, 
equations 8.82 and 8.84 can be used to find the differential and the total cross sections. 


9.4 Scattering states (general potential) 


For arbitrary potentials, scattering cross sections are much more difficult to compute even 
numerically. Some approximate methods (see chapter 11 ) can handle such general potentials 
as long as they are small compared to the energy of the incident particle. The reach of such 
methods, which were originally designed for analytical computations, can be significantly 
increased by the use of a computing machine. 

On the other hand, a non-spherically symmetric scattering potential is very unlikely 
in realistic situations. A scattering experiment with such a potential would require both 
the target and scattered particles to approach each other at a fixed orientation for every 
collision. Such an experimental setup is difficult to come by. So, even for non-spherically 
symmetric potentials, the actual scattering data will show a spherically symmetric character 
as the random orientations of the particles in each collision will create the appearance 
of an average potential over all angles which is spherically symmetric. In some realistic 
experiments one may produce spin aligned target and scattered particles. In such a situation 
the spherical symmetry is truly lost. A proper treatment of these systems requires quantum 
field theory which is beyond the scope of this book. 

So, even though it is possible to come up with a numerical algorithm for scattering 
computations for non-spherically symmetric potentials, we shall not do it here. 
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Problems 


1. A spherically symmetric linear potential is given by 

V (r) = Ar 

where A is a positive real number constant. For this potential do the following. 

(a) Find the recursion relation for the numerical solution of the Schrodinger equation 
and also the necessary initial conditions. 

(b) Develop a computer algorithm to determine the eigenvalues and eigenfunctions 
of such a system. 


2. A spherically symmetric quadratic potential (3-dimensional isotropic harmonic oscil¬ 
lator) is given by 

V (r) = Ar 2 

where A is a positive real number constant. For this potential, repeat the steps 
as in problem 1 and then compare the numerical results with analytical results (an 
extension of the one dimensional harmonic oscillator problem to 3 dimensions). 

3. Phenomenological models of mesons often have the potential between the quark and 
the antiquark components to be a combination of a linear potential (as in problem 1) 
and a Coulomb type of potential as follows. 

V(r) = Ar — B/r 


where A > 0 and B > 0. For this potential, repeat the steps as in problem 1. 

4. Some other phenomenological models of mesons replace the linear part of the potential 
in problem 3 by a quadratic potential (as in problem 2). Hence, the potential is given 
by 

V (r) = Ar 2 — B/r 

For this potential, repeat the steps as in problem 1. 


5. A superlattice is fabricated by depositing several thin layers of two semiconductor 
materials alternated in a periodic fashion. If the layers are thin and uniform enough, 
quantum effects allow such structures to have electronic and optical properties that 
are not found in the constituent materials. An electron trapped by a positive ion 
impurity in a superlattice experiences the following potential. 


V(r) 


hrer 


+ W(z) 


where e is the electron charge, e the permittivity of the material and r = \Jx 2 + y 2 + z 2 . 
x, y and 2 are the usual rectangular coordinates with the z-axis oriented perpendicular 
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to the layers of the superlattice. The function W(z) can sometimes be approximated 
as 

w(‘) = {T° ! or !1 > “ 

| 0 for \z\ < a 

where Wo and a are positive constants. For this cylindrically symmetric potential, 
develop a computer algorithm to compute the eigenvalues and eigenfunctions of en¬ 
ergy [9]. 

6 . Develop a computer algorithm for the computation of the scattering cross section for 
the following potential. 

V(r) = 7Lexp(—or 2 ) 

where A and a are positive real number constants. 

7. For the scattering of neutrons off an atomic nucleus the effective potential can be 
approximated by the Woods-Saxon potential which is given as follows. 


V(r) 


-Vo 

1 + exp[(r — R)/a\ 


where Vq, R and a are positive real number constants. Develop a computer algorithm 
to compute the scattering cross section for this potential. 



Chapter 10 


Approximation Methods (Bound 
States) 


Given a specific problem in quantum mechanics, one needs to find the quickest method 
of solving it. An analytical solution is usually the most desirable. However, as we have 
seen in the previous chapters, such solutions are not always possible. In such a situation, 
numerical methods can often be used successfully. But we have seen in chapter 9 that 
numerical methods can sometimes be very time consuming. Hence, one needs to look for 
alternative methods that would be quicker. Sometimes approximation methods are very 
handy. Some readers might find this last statement somewhat perplexing as numerical 
methods are usually considered to be approximation methods. But here we shall use the 
terms “numerical” and “approximate” with different meanings. A numerical method can 
produce results of indefinitely high accuracy provided enough computer time is spent. An 
approximate method has its accuracy (and sometimes even its validity) limited by the 
characteristics of the specific problem being solved. For example, some series expansions 
converge only under certain conditions. 

Here, we shall discuss two of the most popular approximation methods - the pertur¬ 
bation method and the variational method 1 . 

1 The so-called WKB method [8] will not be discussed even though, historically, it has been very popular. 
This is because, at present times, any problem that can be solved by the WKB method, can be solved with 
far greater accuracy and speed by numerical methods. Besides, the original mathematical justification for 
the WKB method was rather weak and later mathematical treatments that better justify it are too lengthy. 
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10.1 Perturbation method (nondegenerate states) 

Let Ht, the hamiltonian of a system, be written as 

H t = H + H', (10.1) 

such that the eigenstates and eigenvalues of H are already known and written as | n) and 
E n respectively. So, 

H\n) = E n \n), (10.2) 

where n is the label that identifies a specific eigenstate. The corresponding eigenstates and 
eigenvalues of H t will be called \tn) and E tn respectively such that 

H t \tn) = E tn \tn). (10.3) 

The additional part, H\ is defined to be small if the differences (E tn — E n ) are suitably 
small compared to E n . The present method is valid only if H' is small and hence it will be 
assumed to be so in the following. The correction terms necessary to obtain the solution 
of equation 10.3 from that of equation 10.2 can be written as a series of increasing order of 
smallness. To keep track of the order, one uses a parameter A that is later set to be equal 
to 1. In terms of A one can write 



= H + XH', 

OO 

(10.4) 

1 tn) 

= ^2 X s \ns), 

s =0 

OO 

(10.5) 

Etn 

= 5> s £ ns , 

s=0 

(10.6) 


where s is the order of the correction terms |ns) and E ns for \tn) and E tn respectively. 
Substituting this in equation 10.3 and equating terms of the same order on each side, one 
obtains 


(H-E nO )\n0) = 0, (10.7) 

(H-E n0 )\nl) = (E nl -H')\n0), (10.8) 

(H-E n0 )\n2) = (E nl -H')\nl) + E n2 \n0), (10.9) 

(H-E n0 )\n3) = (E nl - H')\n2) + E n2 \nl) + E n3 \n0), (10.10) 


and so on. The first of the above set of equations shows that |n0) is an eigenstate of H. 
As in the present case we are discussing only nondegenerate states, one can unambiguously 
identify 

|n0) = | n), E n o = E, 


(10.11) 
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Using equations 10.2 and 10.11, one can see that multiplying each of the equations 10.7 
through 10.10 from the left by (n| makes their left hand sides vanish. This leaves one with 
the following equations. 

E nl = (n\H'\n), (10.12) 

E n 2 = (n\H'\nl) - E nl (n\nl), (10.13) 

E n3 = (n\H'\n2) - E nl (n\n2) - E n2 (n\nl), (10.14) 

and so on. Here it is assumed that the eigenstates | n) are normalized. As the eigenstates of 
H form a complete set, the correction terms |ns) in each order s can be written as a linear 
combination of the |n)’s as follows. 

\ns) = ^2a n is\i), for s = 1,2,3,.... (10.15) 

i 

Substituting these into the equations 10.7 through 10.10 and using equations 10.12 through 
10.14, it is possible to find all the coefficients a n * s except the a nns . This is because all terms 
containing a nns vanish identically. This leads to the conclusion that the a nns can be chosen 
arbitrarily. The simplest choice, of course, would be zero. This choice can be written in 
two equivalent forms: 

anns = 0, (n\ns) = 0. (10.16) 

Thus the equations 10.12 through 10.14 would simplify to 

E ns = (n| H'\n, s — 1), for s = 1, 2, 3,.... (10.17) 

Hence, the first order correction to the energy eigenvalues would be 

E n i = (n\H'\n0) = (n\H'\n). (10.18) 

The first order correction to energy eigenstates is given in the form of equation 10.15 to be 

l nl ) = ^ a nil\i)- (10.19) 

i 

Substituting this in equation 10.8 and using equations 10.2 and 10.11, gives 

~ E n)a n n\i) = (E n i - H’)\n). (10.20) 

i 

Multiplying this from the left by (j \ and using the orthonormality of the |i) states, one 
obtains: 

/ 7 I /T 7 177 \ 

f or j*n. (10.21) 

It should be noted that the value for a nn \ cannot be determined from equation 10.20. 
However, it has already been chosen to be zero in equation 10.16. This completes the 
computation of first order correction terms. 
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The second order correction to energy eigenvalues is seen from equation 10.17 to be 


E n 2 = (n\H'\nl). 

Using equations 10.19 and 10.21 one then obtains: 

(n\H'\i)(i\H'\n) 


E n2 ^ y jp ip 

i^n K ~ 


As H' is hermitian this gives 


E n2 = E 


\(n\H'\i}\ 2 


E n — Ei 

i^n n 1 


( 10 . 22 ) 


(10.23) 


(10.24) 


From equation 10.15, the second order correction to the eigenstate is seen to be 

In2) = 5> ni2 |i). (10.25) 


Substituting this and equation 10.19 in equation 10.9 and then using equations 10.2 and 
10 .11, one obtains 


'y ( (Ej E ri ))(i n j2 |i) (E n i H ) ^ ) Q"nii i) S~E n 2\7i). 

i i 

Multiplying this from the left by (j (j Y n ) and using orthonormality gives 

(Inj 2 ( E j E n ) — a n jlE n i ^ ) Uni i {j \ H i ). 

i 


Using equation 10.21 this gives 


nj2 fr* (E n - E j )(E n - E^ 


(j\H'\n)(n\H'\n) 
(E n - EjY 


(10.26) 


(10.27) 


(10.28) 


Once again a nn 2 cannot be found from equation 10.27. It was assumed to be zero in 
equation 10.16. This completes the computation of second order correction terms. Higher 
order corrections can be computed in a similar fashion. One could summarize the results 
upto second order as follows: 


Etn 
I tn) 


TP 1 / \ tj !\ \ \( 7l \ H '\ i )\ 2 1 

E n + {n\H | n) + 2 _—-—-b • • • 


n, 


+ E 


i+n En Ei 

(i\H'\n) ( (n\H'\n ) 


i^ n lEn Ei 


En — Ei 


(i\H'\j)(j\H'\n) 

j-^ n (En ~ Ei)(E n — Ej) 


I <) + -.. 


(10.29) 


(10.30) 
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It is to be noted that for higher order terms the computations get significantly more 
involved and hence, writing a computer algorithm for the general .s-th order correction term 
might be useful. But we shall not do it here. 


As an application of the above formalism one may compute the first order correction to 
the solution of the harmonic oscillator problem (see chapter 4) when a quartic perturbation 
is added. Then 


TT P 2 kX 2 KX 4 

H t = -1-1-. 

2 m 2 6 


(10.31) 


where K must be small enough for the perturbation analysis to work. As the solution to 
the harmonic oscillator problem is already known, we identify 


_ P 2 kX 2 
~ 2 ^ + 


H' = 


KX 4 
6 


In terms of the raising and lowering operators one may write 


H' = A(a-a f ) 4 , A 


Kh 2 

24m 2 L0 2 


The expanded form of H' would then be 


(10.32) 


(10.33) 


H’ = A(a 4 
T clo ) 2 


— a^a 3 — aa^a 2 — a 2 a) a — a 3 a) + a) 2 a 2 + a)aa)a + a)ci 2 a) 
a + aa)aa) + a 2 c^ 2 — a) 3 a — a) 2 aa^ — a)aa) 2 — aci^ 3 + a^ 4 ). 


(10.34) 


Now from equations 4.80, 4.82, 10.18 and 10.34 one computes the first order correction to 
the ground state to be 

Eqi = 3A. (10.35) 


For the perturbation analysis to be valid one must have 3 A 
tion 10.21 one finds 


^021 — 


3\/2 A 
huo ’ 


\/3A 


<C hu/2. 


Then from equa- 
(10.36) 


and all other aoji vanish. Hence, from equation 10.19 we get the first order correction to 
the ground eigenstate to be 


| 01 > 


7^ (6|2) ^ |4)) - 


(10.37) 


These computations could also have been done in the position representation. However, it 
would require the computation of several integrals. 


It should be noted that, however small K might be, for some higher excited states the 
correction terms will become too large for a perturbative computation to be valid. This is 
because a quartic potential rises faster than a quadratic potential as the position coordinate 


increases. 
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10.2 Degenerate state perturbation analysis 

If the eigenstates of H , the unperturbed hamiltonian, have some degeneracy then the iden¬ 
tification of |n0) in equation 10.11 need not be unique. The general method of analysis in 
such a situation can be understood by considering the particular case of a doubly degener¬ 
ate energy level, say E n . that corresponds to the two states \n) and | m). Hence, any linear 
combination of | n) and | m) will also be an eigenstate corresponding to E n . As a result of 
the perturbation, these states may not stay degenerate. The two resulting nondegenerate 
eigenstates of Ht must tend to eigenstates of H corresponding to E n as A tends to zero. 
However, in general, these limiting eigenstates of H are linear combinations of | n) and | m) 
and are to be identified as |n0). Hence, one writes 

|n0) = c n |n) + c m \m), (10.38) 

where the set of constants {c n ,c m } will be different for the limits of the two different 
eigenstates of H t . Cn and c, n are yet to be determined. The choice of equation 10.16 must 
now be extended to give 

(n\ns) = 0, (m\ns) = 0, for s = 1, 2,3,... (10.39) 

Equation 10.8 can now be multiplied from the left by each of (m\ and (n| to get two 
equations. These equations are (using equations 10.2, 10.11 and 10.38) 

((m\H'\m) HE n i)c m +(rn\H'\n)cn = 0, (10.40) 

(n\H'\m)c m +((n\H'\n) - Eni)^ = 0. (10.41) 

This is a set of linear homogeneous equations for c m and c n . A nonzero solution of this set 
is possible only if the determinant of the corresponding matrix vanishes. This gives 

((m\H'\m) - E nl )((n\H'\n) - E nl ) - (m\H'\n)(n\H'\m) = 0. (10.42) 

The solution of this gives the two different correction terms for the two corresponding states 
of Hf. 

E n i = ^((m\H'\m) + (n\H'\n )) 

n -i 1/2 

± -((m\H'\m) — (n\H’\n)) 2 + \{m\H'\n)\ 2 . (10.43) 

The two correction terms are seen to be identical if and only if 

(m\H'\m) = (n\H'\n), and (m\H'\n) = 0. (10.44) 

If the two correction terms are different, the degeneracy is said to have been “lifted”. The 
values for c m and c n can now be found for each value of E n i by using equation 10.40 (or 
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equation 10.41) and a suitable normalization. This would give the two possible choices of 
|n0). The correction \nl) to each of these can be found by the same method as for the 
nondegenerate case using the two conditions in equation 10.39. Once the degeneracy is 
lifted, higher order corrections can be found as for the nondegenerate case. 

If the degeneracy is not lifted in first order computations, a similar method can be 
employed in second order computations. 

As an example of the method developed in this section, we shall study the effect of 
a relatively weak uniform magnetic field on the bound states of a spherically symmetric 
system (in particualar the hydrogen atom). This is known as the Zeeman effect. The vector 
potential for a uniform magnetic field B can be written as 

A = i(B x R), (10.45) 

where R is the position vector operator. The hamiltonian, in the presence of such a vector 
potential in addition to the scalar potential q i, would become [ 2 ] 

= < ia46) 

where m is the reduced mass, P the canonical momentum and e the magnitude of the elec¬ 
tron charge. As A is the perturbing term, one can separate Ht in the form of equation 10.1 
with 


H = < 10 - 47 > 

H’ = ^-(P- A + A-P + e.4 2 ). (10.48) 

2m 

For the present computations, the A 2 term can be ignored as the magnitude of B is assumed 
to be relatively small. Using equation 10.45 and the commutators of the components of P 
and R, one can show that P A = A P. Then we have 

H' = —A P = — B x R P = — B R x P = — B L, (10.49) 

m 2m 2m 2m 

where L is the angular momentum. Now, let us compute the first order corrections to the 

first excited states (n = 2) of the hydrogen atom. The 4 degenerate states can be written 
in the form | l,m) for the various possible angular momentum quantum numbers: 

10 , 0 ), | 1 ,- 1 ), 11 , 0 ), | 1 , 1 >. 

Without loss of generality, if the magnetic field is considered to be in the x-direction, then 

, eBL T 
H = 


2m 


(10.50) 
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Using equations 7.39 and 7.40, this would give 


H' = 


eB_ 
4 m 


(L+ + LJ). 


(10.51) 


Now, using equations 7.65, 7.66 and 10.51, the nonzero matrix elements of H' for the 4 
degenerate states with n = 2 are found to be 

(1,-I|i7'|l,0) = (l,0|i7'|l,l) = -^- 

2 y2 m 

The method described earlier for a 2-fold degenerate level can be easily generalized for 
the 4-fold degenerate level in this example. Then, using equation 10.52, the 4 first order 
correction terms can be found to be 


(10.52) 


0 , 0 , 


heB 
^~ 2 m ’ 


heB 
2 m 


Hence, we notice that two of the original degenerate states are still degenerate. If the 
magnetic field were chosen to be in the z-direction, the computations would have been 
somewhat simpler. The computation of the corrections to the eigenstates will be left as an 
exercise (problem 2). 


10.3 Time dependent perturbation analysis 


All hamiltonians considered in this text have, until now, been time independent. Hence, 
energy has been conserved and energy eigenvalues have been meaningful measured quanti¬ 
ties. However, the only direct way of measuring the energy of something like a hydrogen 
atom would be to measure its mass and use the relativistic mass energy equivalence. This 
being a particularly inaccurate method of measurement, one needs to find indirect methods 
of measuring the energy. The most common method would be to allow the system (for 
example, the hydrogen atom) to transfer from one energy level to another and release the 
difference in energy in the form of another particle (for example, a photon). If the released 
particle is a photon, its frequency v can be measured spectroscopically and then the energy 
computed to be 2-kUv. This is a result of the original Planck hypothesis. It will also become 
evident from our later discussion of the relativistic Schrodinger equation for the photon. 
Thus, by this method one can find the differences between energy levels. The energy levels 
can very often be reconstructed from these differences. 

However, to use the above method, it would become necessary to transfer the system 
from one energy level to another. For a conservative system this is, by definition, not 
possible. Hence, a time dependent (thus nonconservative) perturbation, H'(t ), must be 
added to the system to achieve the transfer between levels. This H'(t ) must be small in 
order not to disturb the energy levels of the original system too much. If H'(t) were too 
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large, energy would not be conserved even approximately and hence, energy measurements 
would be of no use. Very often H'(t ) is introduced through the sinusoidal variations of 
electromagnetic radiation. Hence, we shall choose 

H'(t) = Hq sw.(ut), (10.53) 


where Hq is independent of time. The total hamiltonian would be 


H t = H + H\t), 


(10.54) 


and the development of an arbitrary state |s) would be given by the Schrodinger equation 
to be 

ih^ = H t \s). (10.55) 

As the eigenstates |i), of the original hamiltonian H, form a complete set, it must be 
possible to expand |s) as follows. 

|s) = aj(t) exp(— iEjt/h)\i), (10.56) 


where is the eigenvalue of H corresponding to |i). The exponential time dependence 
is written separately from the o*(t) as it is expected that for small H'(t ), will vary 
slowly with time. This is because in the absence of H'(t), would be constant. Now, 
substituting equation 10.56 in equation 10.55 and then multiplying from the left by (j 
would give 

ihex.p(-iEjt/h)—U- = exp(— iEit/h)(j\H'\i). (10.57) 

i 

If we choose 

uji = (.Ej - Ei)/h, (10.58) 

then equation 10.57 could be written as 

(10.59) 

i 

Once again, we can expand the solution in a perturbation series using the parameter A to 
identify the order of smallness of a term. Hence, we write 

OO 

a* = ^2 A s ai S . (10.60) 

s=0 


As before, the H' in equation 10.59 must also be multiplied by a A to keep track of orders. 
Then, substituting equation 10.60 in equation 10.59 and equating terms of the same order 
would give 


dcijQ 

clt 

dtij( s _ir \) 


0 , 

c^r 1 1] a is exp(iWjit)(j\H'\i). 


(10.61) 


dt 


(10.62) 
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Equation 10.61 reasserts the fact that a 3 is independent of time in the absence of the 
perturbation H'. In a typical physical situation, the system is initially (t = 0) in an 
eigenstate of H. At t = 0 the perturbation H' is turned on and at a later time T it is 
turned off. If that initial eigenstate is |n), then for t < 0, a n = 1 and all other dj’s are zero. 
As the cijo do not change with time, this means 

a n o = 1, (10.63) 

ajo = 0 for j n. (10.64) 


Hence, from equation 10.62 the first order corrections for t > T are 

f T 

/ (j\H'(t)\n) exp(iujj n t)dt. 

J o 

Then from equation 10.53 we obtain 


Cljl — 


(j \H 0 \n) 
2 ih 


exp [i(uj n — oj)T] — 1 exp [i(uj n + iv)T] — 1 


iOjn M 


IVjn + u> 


Hence, dji can be seen to have maximum values at iVj n = ±u; that is 

Ej = — hco, or Ej = E n + hiv. 


(10.65) 


( 10 . 66 ) 


(10.67) 


The first case is interpretted as a high probability for the system to emit the Planck en¬ 
ergy Tuv and transfer to a suitable lower energy state. The second case shows a similar 
high probability for the system to absorb the energy hco and transfer to a suitable higher 
energy state. Of course, in each case, the corresponding lower or higher energy must be 
an eigenvalue of H. The first case is often known as stimulated emission where external 
electromagnetic radiation of frequency iv stimulates the system to radiate energy in the 
form of photons of the same frequency. Emission can also take place in the absence of 
external radiation (spontaneous emission). However, such a phenomenon can be explained 
only by a quantum field theory which is beyond the scope of this book. The second case is 
that of absorption of energy from the external radiation. Both the emission and absorption 
of specific frequencies of electromagnetic radiation can be observed spectroscopically. Thus 
it is often possible to reconstruct the energy eigenvalues of H from spectroscopic data. 

It is to be noted that the peaks of a 3 \. as a function of u>, become narrower and higher if 
T becomes larger. This is understood by considering the perturbation H' to be a sinusoidal 
wave within a rectangular envelope ranging from t = 0 to t = T. The Fourier transform of 
such a time function can be seen to have a larger range of frequencies for smaller T. This 
means that, for smaller T, the perturbing external radiation provides photons of a larger 
range of frequencies and hence, allows transitions of the system to a larger range of energy 
states. This prompts one to write what is known as the time-energy uncertainty relation: 


AEAt > h, 


(10.68) 
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where A E is a measure of the range of energies around Ej that it is possible to transfer to 
and At is a measure of the duration of time for which the perturbation is on. The symbol 
‘<7 means ‘approximately greater than or equal to’ In the present case At = T. This 
uncertainty relation can be written mathematically more precisely by defining A E and At 
more precisely. But we shall not do it here. 


Another item to be noted is that cij i also depends on the time independent factor 
(j | Hq | n). An extreme example would be that of (j\Ho\n) = 0. In such a situation the 
probability of transition from | n) to | j) will be zero even if ujj n = Eu). Such a condition 
gives the so called selection rules. 


In a process like the ionization of an atom, the final state \j) is not a bound state and 
hence Ej is not a discrete energy level. There is in fact a continuum of energy eigenvalues Ej. 
This makes it more practical to compute a transition probability (or ionization probability) 
per unit time for the whole range of energies Ej. Of course, in such a situation Ej > E n 
and hence, the part of cij± in equation 10.66 that describes the emission process is small 
enough to be ignored. This gives the transition probability to be 


M 2 = 


\{j\H 0 \n)\ 2 sin 2 [(ujj n - iv)T/2] 


h 2 (uj 


jn 


■UJ 


(10.69) 


The total transition probability to all states with energies around Ej = E n + tux (the peak 
of this probability function) can be obtained by integrating | a-yi | 2 over such states: 


P = J \aji\ 2 p(j)dEj, 


(10.70) 


where p(j) is defined as the density of states at the energy eigenvalue Ej. This total 
transition probability is expected to increase with the duration T of the perturbation. 
Hence, one defines a convenient measurable quantity called the transition probability per 
unit time: 

w = P/T = T- 1 J | ajl \ 2 p(j)dEj. (10.71) 

For a large enough T, |oji| 2 would peak so sharply that p(j) and (j\Ho\n) would be effec¬ 
tively independent of Ej in the range where |ayi | 2 is not negligible. Hence, in the integral 
of equation 10.71, one can pull the term p(j)\(j\Ho\n)\ 2 outside the integral. This gives 


7T 


W = ^f)dU)\(j\Po\ n )\ 2 ■ 


(10.72) 


The above integral was done by substituting x = (iOj n 
definite integral. 

r co 


-2 


sm 


2 xdx = 7T. 


— u>)T/2 and using the following 

(10.73) 


The infinite limits are justified by the fact that the integrand is sharply peaked for large T. 
Equation 10.72 is sometimes called Fermi’s golden rule. 
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10.4 The variational method 


A weakness of the perturbation method is that the total hamiltonian H t must be a close 
approximation of some H for which the eigenvalues and eigenstates are already known. 
Hence, we shall now discuss a method (called the variational method) that can be used 
to find eigenvalues and eigenstates for any bound state problem. However, this method 
also has a shortcoming. It requires the knowledge of the eigenstate as a function of some 
undetermined parameters prior to finding the solution. Guessing such a functional form is 
usually possible if one has some physical understanding of the problem. It will also be seen 
that the variational method is most appropriate for the ground state. The computation of 
higher excited states becomes progressively more involved. 

To understand the variational method we shall prove the following theorem. 


Theorem 10.1 For an arbitrary nonzero state |s), the expectation value of H, the hamil¬ 
tonian, satisfies the following inequality. 


(H) s = 


m\ S ) 

(s|s) 


> Do 


where Eq is the lowest energy eigenvalue (ground state energy). On physical grounds, it is 
assumed that Eq exists. 


Proof: Let the set of eigenstates of H be {|i)} (i = 0,1, 2,...) with the corresponding set 
of eigenvalues {£)}. As { |i) } must be a complete set, it is possible to expand |s) as 
the following series. 

\s) = J2“si\i)- (10.74) 

i 

Hence, using the orthonormality and eigenstate property of {|i)} 


(s\H\s) T,i\a S i\ 2 Ei T,i\a S i\ 2 (Ei - E 0 ) 

, I ~T — ^ - ry = -- h Eq > Eq 

wl'®/ Eii |®si| Eii |®si| 


(10.75) 


as Eq is the lowest energy and hence (£) — Eq) > 0 for all i. This completes the proof. 


The following corollary of the above theorem can also be proved with little difficulty (see 
problem 5). 


Corollary 10.1 If the state |s) is orthogonal to the lowest n eigenstates of H, that is 

a s i = (i\s) = 0 for i = 0,1 ,..., n, 


m\s) 

(s|s) 


> E n +1 - 


then 
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The variational method involves the choice of a trial ground state \a, 0) that is a function 
of some arbitrary parameters ccj (i = 1,2,...). The expectation value of H for this state 
is denoted by (H) a q. A minimum of (H ) a o with respect to all the parameters a* can be 
found. From theorem 10.1, we know that this minimum must be greater than or equal to 
Eq, and if the functional form of the trial ground state is chosen appropriately, it could be 
a good approximation for Eq. The a* determined in the process of minimization, would 
also give the corresponding approximation for the ground state |0). To find the first excited 
state, corollary 10.1 would be used. A trial first excited state is chosen to be orthogonal 
to the already determined approximate ground state. This trial state can be used to find 
the approximations of E\ and 11) by repeating the procedure used for the ground state. In 
principle, this method can be used for the computation of any number of higher excited 
state. However, in practice, it becomes progressively more difficult to choose trial states 
for higher energies that must be orthogonal to all lower eigenstates. 


To illustrate the method outlined above, we shall find the ground state |0) and the cor¬ 
responding energy E 0 for a particle of mass m placed in the following spherically symmetric 
potential. 


V(r) = -k 


exp(— r/a) 


(10.76) 


where k and a are positive constants. For this potential, it can be shown that the position 
representation uq, of the ground state |0), behaves as exp(—or) at large r. Hence, for the 
ground state we shall choose this functional form to be the trial eigenfunction. 


uq = exp(—or), 


(10.77) 


where a will be the only variational parameter used here for the minimizing of the expec¬ 
tation value of H. The expectation value of H for this wavefunction is: 


= f u^Hupdv 
° f UqUqcIv 


(10.78) 


where civ is the volume element and H is in its position representation. If H and dv 
are written in spherical polar coordinates, the above integrals are quite straightforward to 
compute. The result is: 


(H)o 


U 2 a 2 Aka 3 a 2 

2m (2 aa + l) 2 


(10.79) 


For the value of a that will minimize (H) o, one must have 


d(H) o 
da 


(10.80) 


d 2 (H) o 
da 2 


> 0 . 


and 


(10.81) 
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From equations 10.79 and 10.80, one finds that the value of a that will minimize (H) q must 
be a solution of the following cubic equation. 

16(aa) 2 — 12aa(2aa + 1) + b(2aa + l) 3 = 0 , (10.82) 


where 


b = 


n 2 


mka 


(10.83) 


Hence, finding a involves the solution of equation 10.82. This can be achieved numerically 
by standard methods. For the specific case of b = 20/27, the solution is possible without 
resorting to numerical techniques. It is seen to be given by 


aa = 1, 1/10, —5/4. 


(10.84) 


The negative root is not possible as both a and a are positive. Of the other two roots, the 
first one is seen to give a lower value of ( H ) 0 . It can also be seen that for aa = 1, the 
inequality 10.81 is satisfied. Hence, we conclude that the minimum in (H) o is obtained by 
aa = 1. This gives the estimate for the ground state energy to be 


F = 

0 27o’ 

and the corresponding normalized eigenfunction is 


(10.85) 


u 0 



exp (—r/o). 


( 10 . 86 ) 


One may test this method by taking the limit of a —» oo. In this limit the potential becomes 
the same as that for the hydrogen atom. The corresponding (H) o is seen to be 


(H)o 


h 2 a 2 

2m 


ka. 


(10.87) 


Minimizing this gives the value of a to be mk/Ti 2 . Hence, the estimated value for Eq is 


E 0 = (10.88) 

This and the corresponding eigenfunction u 0 can be seen to be the same as the exact results 
of chapter 8. 


Problems 

1. If the one dimensional harmonic oscillator is perturbed by a potential of the form 

H' = Hexp(— aX 2 ), 

where A and a are positive constants, find the first order correction to the ground 
state energy. (Hint: the position representation is convenient for this problem.) 
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2. Find the first order corrections to the energy eigenstates for the Zeeman effect case 
discussed in the text. 

3. If the hydrogen atom hamiltonian is perturbed by the following: 

H' = AL 2 , 

where A is a constant and L 2 is the magnitude squared of the angular momentum, 
find the first order perturbation corrections to the energies of the degenerate states 
corresponding to n = 3. 

4. For the time dependent perturbation given in equation 10.53, let 

Hq = ik • R 

where k is the wave vector of the perturbing electromagnetic wave and R the position 
vector. If the unperturbed system given by H is the hydrogen atom, find the selection 
rules for the differences in the angular momentum quantum numbers (l, m) between 
the initial and final states of a transition. 

5. Prove corollary 10.1. 

6 . For the perturbed harmonic oscillator of problem 1, find the ground state energy by 
the variational method. Compare the results to that of the perturbation computation. 
(Hint: One may consider the following as a trial wavefunction with a as the variational 
parameter: uq = exp(— ax 2 ).) 



Chapter 11 


Approximation Methods 
(Scattering States) 


In general, the quantum mechanical scattering problem is difficult to present in a well 
defined mathematical form. This, obviously, makes it more difficult to solve. The cause of 
this difficulty is in the inherent nature of the time independent Schrodinger equation. It is 
an equation that needs boundary conditions. These boundaries are usually at infinity. For 
a scattering problem one does not know these boundary conditions a priori. In fact, the 
solution of the problem amounts to finding the behavior of the wavefunction at infinity! 
Hence, in the special cases discussed in chapters 5 and 8, we have had to guess the form of the 
solution at infinity with some undetermined parameters. The solution of the problem had 
then amounted to the determination of these parameters. However, in a general situation, 
it is not always possible to guess the form of the solution at infinity. Hence, one sometimes 
needs to approach the problem differently. 

In the following we shall discuss such a different approach. The problem will be 
considered to be a limiting form of an initial value problem. The potential will be assumed 
to be “turned on” at some initial time —T/2 (T —> oo) and “turned off” at the later time 
T/2. This does not change the problem physically, but allows one to assume the initial 
state to be a free particle state with a fixed momentum (i.e. a momentum eigenstate) 
corresponding to the incident beam. The final state is also a free particle state, but it is 
expected to be a linear combination of different momentum eigenstates. The probability 
of the final state being in a momentum state in a certain direction will give the scattering 
cross section in that direction. Hence, we need a formalism that will allow us to find the 
state of the system at the time T/2, if it is known at the time —T/2. This is a standard 
initial value problem and can be solved by using the following Green’s function method. 


136 



CHAPTER 11. APPROXIMATION METHODS (SCATTERING STATES) 


137 


11.1 The Green’s function method 


Let the Schrodinger equation be written in the following form. 

= [H 0 + V]\s,t), (11.1) 

where |.s. t) is an arbitrary state for a system given by the hamiltonian 


H = H 0 + V, 


( 11 . 2 ) 


and 


(H.3) 

is the kinetic energy part of the hamiltonian. The state \s,t) is labelled by s and, for 
convenience in the present discussion, its dependence on time t is explicitly specified in 
the notation. We shall now state without proof that the Schrodinger equation specifies a 
well defined initial value problem i.e. if | s,t) is given at some time t, then the Schrodinger 
equation will give a unique | s,t'} at another time t'. Due to the linearity of the equation, 
it is expected that \s,t) and \s,t') are linearly related and hence the following convenient 
form of the relation is chosen. 


s, t') = iG(t t)|s, t), 


(11.4) 


where i = \/—1 and G(t',t) is an operator on the linear vector space V of quantum states 
as defined in chapter 1. The position representation of G(t',t), given by the following, is 
usually called the Green’s function. 


(r / |G(i / ,t)|r) = G(r',t';r,t), (11.5) 

where |r) and |r 7 ) are eigenkets of the position operator. To avoid superfluous nomenclature, 
we shall call G(t', t ) also the Green’s function. From problem 2 of chapter 4, we notice that 
G(t', t ) is related to the time translation operator: 

G(t',t) = <-iexp[—iH(t > — t)/h\. (11.6) 


For our present application, it is convenient to separate G(t', t) into two parts that propagate 
a state either forward or backward in time. Hence, we define the retarded Green’s function 
or the propagator as 


G + (t',t ) 


G(t', t ) for t' > t 
0 for t' < t 


(11.7) 


This gives 


9(t’ — i)|s, t') = iG + (t', t)|s, t), 


( 11 . 8 ) 


where Oft! — t) is the step function: 


Oft’ - t ) 


1 for t! > t, 
0 for t! < t. 


(11.9) 
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Similarly, one defines the advanced Green’s function as 




—G(t',t ) for t 1 < t 
0 for t 1 > t 


( 11 . 10 ) 


This gives 

9(t — i')|s, t’) = —iG~(t\ t)|s, t). 

It can be shown that (see problem 3) 

A.0(t' -t) = 8(t' -t). 

Using equation 11.12, one can differentiate equation 11.8 to give 

6(t' -t)\s,t') +0(t' = i-^-G + (t',t)\s,t). 


( 11 . 11 ) 


( 11 . 12 ) 


(11.13) 


Using equations 11.1, 11.8 and the assumption that V, the potential, does not have a time 
derivative operator term, one obtains 

9(t'= h~ 1 HG + (t',t)\s,t). (11.14) 

Also, from a property of the delta function (see equation 1.72) it is seen that 

8(t' — i)|s, t') = 6(t! — t)\s, t). (11.15) 


Hence, from equations 11.13, 11.14 and 11.15 we get 

G + (t',t)\s,t) = 8(t' - t)\s,t). (11.16) 

As | s,t) is an arbitrary state, one concludes that the following operator relation is true in 
general. 

i-^~±H\G+(t',t) = 8(t'~t). (11.17) 

It can be shown that G~(t',t ) also satisfies equation 11.17: 

'^7 - j^H G~(t',t) = 8(t' —t). (11.18) 

For the special case of the free particle, the Green’s functions will be called Gq" and G'q . 
They satisfy the following equations. 

i W~l Ho \ G ^ t ' ,t) = (11 - 19) 

= 6(1/-t). (11.20) 

Now we shall write G' ± (t / , t) (the ± superscript denotes both the retarded and the advanced 
functions) in terms of G,y (tJ. t ) and the potential. In order to do this we need some formal 
mathematical definitions. 
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Definition 38 A general TIME DOMAIN LINEAR OPERATION Q on a time dependent 
state |s, t) is defined by an operator Q(t, t') with the two time arguments t and t' as follows: 

I s,t) | r,t) 

such that 

\r,t) = Q\s,t) = I Q(t,t')\s,t')dt', 

where the integration overt' is from —oo to +oo. Note the compact notation chosen for the 
time domain operation by Q. Q, without the time arguments, denotes an integration over 
one of the time arguments as shown. This allows a natural matrix interpretation that will 
be discussed in the following. 

CH(t'. t ) represent such time domain operators. The product of two time domain operators 
P and Q is defined through their successive operations: 

| s,t) Q | r,t) | u,t), (11.21) 

such that 

| u,t) = P\r, t) = PQ\s, t) = I P(t,t')\r,t')dt' 

= j P(t,t') I Q(t',t")\s,t")dt" dt' 

= J j P(t,t')Q(t',t")dt' | s,t")dt". (11.22) 

Hence, the product of two time domain operators P and Q is found to be also a time 
domain operator that is defined by the following operator: 

[PQ\(t,t") = J P(t,t')Q(t\t")dt'. (11.23) 

It is to be noted that these operators bear a strong resemblence to matrix operations. The 
two time arguments are continuous equivalents of the two discrete indices of a matrix. This 
matrix nature of time domain operators is over and above their matrix nature in the linear 
vector space of kets (see chapter 1). In fact, if these operators and the corresponding kets 
are written in the position representation, both the space and time coordinates behave as 
continuous indices for matrices. This process of putting space and time on similar footing 
is useful for the discussion of relativistic quantum mechanics (see chapter 13). For now, we 
need to identify the identity operator for such time domain operations. It is clearly seen to 
be the delta function: 


I t = 6(t - t'). 


(11.24) 
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The operator in the square brackets in equation 11.17 is also a time domain operator. It 
can be represented in the standard form with two time arguments as follows: 

i———H —» i8'(t — t') — h~ 1 H8(t — t'), (11.25) 

where 8'(t — t') represents the derivative of the delta function as defined in chapter 1. The 
form of the operator shown in equation 11.25 can be shown to be correct by operating on 
an arbitrary time dependent ket | s,t) i.e. one can verify that 

i-^j — —H |-s. /.) - - — t')-’t')]\s,t')dt'. (11.26) 

If the time domain operator of equation 11.25 is called K, then equations 11.17 and 11.18 
can be written in the following compact form. 

KG ± = I t . (11.27) 

Hence, if A' -1 is defined to have the standard meaning of an inverse, then 

G ± = K~ 1 . (11.28) 

This gives the interesting result that the inverse of K is not unique! The free particle 
version of equation 11.28 would be 

GtmKo 1 , (11.29) 

where the differential operator version of Kq is 

M4-H- (lL30) 

Then, from equations 11.2 and 11.28, one may write 

G ± = {K 0 -V/h}- 1 . (11.31) 

Now, for some small enough V, we may formally expand the right side of the above equation 
as a binomial expression. Such an expansion for operators, is not always valid. We shall 
justify it for the present case after considering the result as follows. 

G ± = [K 0 (I t - K^V/h)}- 1 = [I t - K^V/h}- 1 !^ 1 
= [I t + K^V/h + (Kq 1 V/K)(Kq 1 V/K) + .. ,}K^ 

= G^ + h~ l G±VG± + h~ 2 G±VG±VG± + ... (11.32) 

Here it is seen that the nonuniqueness of the inverse of K on the left hand side shows up 
in the nonuniqueness of the inverse of Kq on the right hand side. Gq has been used for 
A^ 1 with the understanding that the equation is correct for all ‘+’ superscripts or all ’ 
superscripts. It can be seen that mixing ‘+’ and ’ superscripts will not give the correct 



CHAPTER 11. APPROXIMATION METHODS (SCATTERING STATES) 


141 


limit for V —> 0. If we consider V to be time dependent, its time domain operator would 
be represented as 

V ^V(t)8(t-t'). (11.33) 

Then it can be seen that each term in the infinite series of equation 11.32 has the meaning 
of a time domain operator. For example: 

[G±VG±VG±]( tl ,t 6 ) 

= J J IJ G o (ti,h)V(t 2 )8(t 2 ~t 3 )Go (t 3 ,U) x 

xl/(i 4 )<5(i 4 - t 5 )GQ(t 5 ,t 6 )dt 2 dt 3 dt 4 dt 5 

= J f G^(t 1 G2)V(t 2 )G^(t 2 M)V(U)G^(UMdt2dU. (11.34) 

With some changes in the time parameters the above result is rewritten as 

[G±rG±V'G±](t,0 

= // Gi(t,t 1 )V(t 1 )Gi(t 1 ,t 2 )V(t 2 )Gi(t 2 ,f)dt 1 it 2 . (11.35) 

Similarly, 

[G±VG±](t,t f ) 

= f Gq (t,h)V(ti)GQ (ti,t')dti. (11.36) 

For some small enough V, if the infinite series of equation 11.32 converges 1 , it can be 
shown to give solutions of equations 11.17 and 11.18. In doing this, we substitute the series 
solutions from equation 11.32 into the left hand sides of equations 11.17 and 11.18: 

KG ± = [Kq - V/n]G ± = K 0 G ± - n~ 1 VG ± 

= [It + h-'VGfi + h~ 2 VG±VG± + .. .] 

- [n~ l VG± + h~ 2 VG±VG± + ...] 

= It, (11-37) 

where equation 11.29 is used to see that K 0 G^ = I t . This is the justification for the binomial 
expansion procedure used in equation 11.32. It can be readily verified that equation 11.32 
could also be written as 

G ± = + h~ l G±VG ± . (11.38) 

Although we have derived the above equations for both the retarded and the advanced 
Green’s functions, only the retarded function G + will be needed for most applications here. 
In the study of quantum field theories, an interesting linear combination of G + and G~ 
is used to represent the behavior of both particles and antiparticles. This combination is 
called the Feynman propagator. 

1 Here convergence means the existence of a finite limiting value of the quantity (r|G ± |s) where (r| and 
|s) are arbitrary. 
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11.2 The scattering matrix 


Equipped with the Green’s function, we can now return to the scattering problem. As 
stated earlier we visualize the scattering process to start with a beam of free particles that 
have a sufficiently sharply defined momentum such that each particle may be considered 
to be in a momentum eigenstate corresponding to that momentum. As before, the density 
of the beam is taken to be low enough to assume each particle to be a system by itself that 
does not interact with the others. After this momentum eigenstate is in place, at some time 
—T/ 2, the scattering potential is turned on. Let the initial state at some time t (t < —T/2) 
be denoted by |p, t) where p has the corresponding momentum eigenvalue components. 
The potential is turned off at a later time T/2. At a time t! ( t' > T/2), the initial state 
would have transformed to some scattered state | s,t') given by the Green’s function to be 

\s,t') = iG + (t',t)\p,t). (11.39) 

The scattering cross section computation involves the probability of detecting a particle in 
some given direction. This could be found from the probabilty of the final particle being in 
a momentum eigenstate |p',T) where p' is in the given direction. The probability of such 
an event is given by postulate 4 of chapter 2 to be proportional to |(p / ,i / |s,i / }| 2 . Hence, 
one must compute the following quantity which is known as the scattering matrix. 

S(p',p) = (P = i(p',t'\G + (t',t)\p,t). (11.40) 

Using equation 11.38, one then gets 

S(p',p) = i(p',t'\G£(t',t)\p,t) 

+ ih~ x J (p , ,t / |G' ( /(t / ,ti)U(ti)G' + (ti,t)|p,t)dti. (11-41) 

As the state |p ,t) is a momentum eigenstate, it must be an energy eigenstate for the free 
particle (see chapter 4). Hence, its time dependence is given by equation 4.4 to be 

|p, t) = exp(-iEt/h)\p), (11.42) 

where E = p 2 /(2m) and |p) = |p, 0) is the momentum eigenstate at time t = 0. Thus the 

operation with the free particle propagator gives (from equation 11.8) 

Gq (t',t)\p,t) = -i\p,t'), (11.43) 

as it is known that t' > t. It can also be shown from the relation of G - and the adjoint of 
G'+ (see problem 2) that in the integral in equation 11.41 

(p' ,t'\G^(t' Ri) = -i(p',G|, (11.44) 

where it is noted that t' > t\ as the potential is defined to be zero for any time greater than 
T/2. From equations 11.41, 11.42, 11.43 and 11.44 one obtains 

S(p',p) = 6 3 (p' -p) + H- 1 j(p',t 1 \V(t 1 )G + (t 1 ,t)\p,t)dt 1 , 


(11.45) 
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where the three dimensional form of equation 3.24 is used to get 

(p'Ip) = <5 3 (p' - p) = 6(p' x - p x )6(p' y - p y )8(p' z - p z ), 

which is the product of three delta functions in the three momentum directions. As the 
potential goes to zero for t\ < t, one may replace the following in equation 11.45. 

G + (h,t)\p,t) = M|s,£i), (11.46) 

where |s, t\) denotes the state of the system at the time t\. Then we have 

S( p',p) = 5 3 (p' -p) - ih~ l J (p',ti|y(ti)|s,ti)dti. (11.47) 

In particular, if the scattering potential is time independent, then one may write 

V(t) = Vg(t), (11.48) 


where V depends on position alone and 


ff(t) 


1 for -T/2 <t<T /2 
0 otherwise 


(11.49) 


and T —> oo. For practical purposes an actual measurement of scattering is made in a direc¬ 
tion away from the incident beam direction 2 and hence the scattered particle momentum 
p' is different from p. Hence, scattering amplitude computations require only the following 
part of S(p', p). 

S s ( p',p) = -ihr 1 j (p' ,ti\V(ti)\s,ti)dti. (11.50) 


The corresponding probability is proportional to |5 s (p / ,p)| 2 . As the eigenstates |p 7 ) are 
continuous, the transition is expected to occur to a group of states in the infinitesimal 
neighborhood of p'. If the number of states in this neighborhood is dM, then the probability 
of transition is proportional to \S s (p',p)\ 2 dM. Similarly, the probability of scattering in 
any specific direction (including the direction of the incident beam) can be seen to be 
p)| 2 dM and hence, the total probability of scattering is / ^(p', p)| 2 dM. Then, the 
fractional probability of scattering, in the direction p 7 , relative to the total probabilty of 
scattering is 

/ g |g,( P C p)| 2 rfAJ 
f \S{p',p)\ 2 dM ’ 

2 Most of tlie incident beam goes through unscattered. So the off-axis scattering is small and requires 
high sensitivity particle detectors for its measurement. Such detectors would get overloaded (and probably 
destroyed) if the strong unscattered beam were to hit them. Besides, the unscattered beam has no interesting 
information anyway. 




CHAPTER 11. APPROXIMATION METHODS (SCATTERING STATES) 


144 


where f E denotes an integral over only those states |p 7 ) that are in the same scattering 
direction but have different energies E. As V is usually small, equation 11.47 allows us to 
approximate <S'(p / , p) by <5 3 (p 7 — p). Thus we have 


J E \S s (p',p)\hlM 
/|<S 3 (p 7 ,p)|W • 


(11.52) 


We now need to write dM in terms of a d 3 p' , the infinitesimal volume element in the p 7 
space. To do this we realize that for a discrete set of eigenstates |i), labelled by an integer 
i, the number of states AM between i = M and i = M + AM can be written as 


M+AM 

AM = ^ (i\I\i), (11.53) 

i=M 

where I is the identity operator. This can be generalized for the continuous states |p 7 ) to 
get 

dM = (p'\I\p')d 3 p'. (11.54) 

For the continuous eigenstates |p), the identity may be written as 


I = 


J 


p)(p| d 3 P- 


(11.55) 


Hence, 


dM = 


(p / |p)(p|p / )d 3 P 


d 3 p'. 


From normalization it is seen that (p 7 |p) = <5 3 (p 7 — p). This gives 


dM = 6 : v - P yy. 


(11.56) 


(11.57) 


The <5 3 (p 7 — p 7 ) in the above expression is infinite. However, we shall leave it as such 
expecting later cancellation 3 . Now the integral in the denominator of equation 11.52 can 
be evaluated to give the following. 

w = I E \S s (p\p)m 

6 3 (p - p) 

Physically, it can be seen that W depends on T, the duration for which the scattering 
potential is turned on. Hence, it would be convenient to define the transition rate per unit 
time as 

w = W/T. (11.59) 

3 If one feels uncomfortable carrying around such infinities, it is possible to use one of the limiting forms 
of the delta function (see for example equation 1.78) without taking the limit right away. The limit can be 
taken at the end of all computations at which point no infinities will remain! 


(11.58) 
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In equation 5.16 the probability current was defined. Using that definition we can see the 
probability current of the incident beam is 

S = (p/m)|(r|p)| 2 , (11.60) 

where |r) is the position eigenstate and (r|p) is the position representation of the momentum 
eigenstate. From this one may compute the fractional probability current, by dividing by 
the total probability of the incident beam which is 


f l( r lp)l 2 d 3 r = (p|p) = 6 3 ( p - p). 

Hence, the fractional probability current of the incident beam is 

g = Pl( r |p)| 2 

m<5 3 (p — p) 

A three dimensional generalization of equation 3.20 gives 

(r|p) = (2irh)~ 3 / 2 exp(ir • p /%). 


(11.61) 


(11.62) 


(11.63) 


Hence, 

s = _E_. 

(27rft) 3 m<5 3 (p — p) 

Using fractional probabilities in equation 5.21 would give the scattering 
follows. 


(11.64) 
cross section cr as 


adu> = w/ |s|. (11.65) 

Using equations 11.58, 11.59 and 11.64 one then obtains 


adu = m f |5 s (p , ,p)| 2 dV, (11.66) 

pT J E 

where p = |p|. As the final energy of the scattered particle is given by E' = p' 2 /(2m), it 
can be seen that in the spherical polar coordinates in p' space 

d 3 p' = p 2 cWd(j)dp' = mp dE' dOdcj) = mp'dE'dui. (11.67) 


Equations 11.66 and 11.67 would then lead to the following. 


(2irh ) 3 m 2 
pT 


J p'\Ss(p',p)\ 2 dE', 


( 11 . 68 ) 


where the subscript E on the integral is dropped as the range of integration is now clear 
from the differential dE'. 
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11.3 The stationary case 

The computation of a from equation 11.68 is quite formidable in general. However, a useful 
special case (stationary case) can be handled with relative ease. This is the situation where 
the potential is time independent as given by equation 11.48, and the energy of a state is 
conserved during its propagation in time i.e. the state |s,ti) defined in equation 11.46 can 
be written as 

|s, ti) &= exp(— iEti/h)\s), (11.69) 

where E = p 2 /(2m) is the energy of the incident particle and |,s) is time independent. Then, 
from equations 11.42, 11.48 and 11.50 one obtains 

S(p',p) = -ih^/p'lVls) j g(ti)eyLp[i(E' - E)ti/Ti\dti. (11.70) 

Inserting this in equation 11.68 gives 

I p '\(p'\V\s)\ 2 j g(ti)eyip[i(E' - E)ti/n\dti dE' . (11-71) 

The integral over t\ can be seen to tend to a delta function as T —> oo. This delta function 
peaks at E = E' . Hence, the term p'\ (p'lHls)) 2 can be replaced by its value at E = E' and 
then taken out of the dE' integral. This gives |p'| = p' = p and hence 

o = <y27T ^ m \(p'\V\s)\ 2 J J g(ti)exp[i(E' - E)ti/h]dti dE'. (11.72) 

The integral in equation 11.72 can be written as 

2 

j j g(t\) exp[i(E' — E)ti/h\dti dE' 

= / // exp[i(E' — E)(t — t')/h]dtdt'dE' 

= 2ttH J j g(t)g*(t')6(t — t’)dtdt' 

= 2nh J \g(t)\ 2 dt = 2i tUT. (11.73) 

In the last step above, the definition of g(t) from equation 11.49 is used. Now equation 11.72 
reduces to 

a = (27r) 4 li 2 ?n 2 |(p / |H|s)| 2 . (11.74) 

This happens to be a special case of Fermi’s golden rule. The state |s) can be computed to 
a desired order of accuracy using the Green’s function and then inserted in equation 11.74. 
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11.4 The Born approximation 


For the lowest order computation of <j, one writes equation 11.39 as 

| s,t') = iGq (t f ,t)\p,t). = |p, t') = exp(— iEt'/K)\p). (11.75) 

Hence, in the lowest order approximation (called the Born approxomation), |s) is the initial 
momentum eigenstate |p). Then equation 11.74 gives 

a = (27r) 4 ft 2 m 2 |(p / |'F|p)| 2 . (11.76) 

As the potential is usually given as a function of position the following computation is best 
done in the position representation. 


(p'|V|p) = J J (p / |r / )(r / |H|r)(r|p)d 3 ?’ / d 3 r 

= I J(p'\r')V(r)6 3 (r' — r)(r\p)d 3 r'd 3 r 
= j (p / |r)H(r)(r|p)d 3 r 

= (2tt?i)~ 3 I V(r) exp[ir • (p — p')/h]d 3 r, (11.77) 


where equation 11.63 is used as the position representation of the momentum eigenstate. 
If 


q = (p' - p)/h, 


(11.78) 


then equations 11.76 and 11.77 give 


2 

m 

An 2 l% A 


V (r) exp[—iq • r \cl 3 r 


(11.79) 


For a spherically symmetric potential, the angular part of the integration in equation 11.79 
is quite straightforward. The result is 


a = 


4 m? 

¥^ 


2 


rV(r) sin(qr)dr , 


(11.80) 


where r = |r| is the radial coordinate and q = |q|. For the stationary case, being discussed 
here, |p'| = |p| = p. The angle 0 between the directions of p' and p is then given by 


cos 0 = p’ ■ p/p 2 . 


(11.81) 


Also 


q = h 1 |p / — p| = (p/h)[2 — 2cos 0] 1 / 2 = (2 p/h) sin(0/2). 


(11.82) 
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As an example, we shall consider the case of electrons scattering from a neutral atom of 
atomic number Z. The potential for such a scatterer was given in chapter 9: 


V(r) = 


Ze 2 exp (—r/B) 
Ittcq r 


The resulting scattering cross section can be computed to be 


(11.83) 


m 2 Z 2 e A 

(2iTe 0 h 2 ) 2 (q 2 + B~ 2 ) 2 


(11.84) 


The angle dependence of a is due to q. The total cross section can be computed by 
integrating over all angles. As there is no 0 dependence, this leads to 


H . 2nh 2 f 2 P/ h 

a t = 27r / cr sin 6cW = —/ aqclq 
Jo P Jo 

m 2 Z 2 e 4 B 4: 
n(eoh) 2 (4p 2 B 2 + h 2 ) 


(11.85) 


Problems 

1. Using the definitions in equations 11.8 and 11.11 show the following. 

G + (t',t) = iG + (t',ti)G + (t\,t) if t' >t\> t. 

G~(t',t ) = —iG~(t',ti)G~(ti,t) if t' < t\ < t. 

2. Show that the adjoint of G~(t',t ) is G + (t,t') i.e. 

[G'“(t / ,t)] t = G + (t,t'). 

3. Prove equation 11.12 from the definition of the delta function given in chapter 1. 

4. Prove equation 11.18. 

5. Verify the relation in equation 11.26. 

6. Find the scattering cross section in the Born approximation for the following spheri¬ 
cally symmetric potential. 

V{r) = | 


where Vq is a constant. 


Vo for r < a 
0 for r > a 



Chapter 12 


Spin and Atomic Spectra 


A comparison of the classical and quantum postulates of chapter 2 brings out an interesting 
distinction. The classical descriptor, the trajectory, is postulated to be directly observable. 
But the quantum descriptor, the state vector, is observed only indirectly through the prob¬ 
abilities of measurements. Also, the state vector is defined only through its properties and 
not as any specific mathematical object. This allows the theoretical freedom of choosing 
different types of mathematical objects as state vectors. Infinite dimensional wavefunctions 
(as defined in chapter 3) as well as finite dimensional vectors can represent state vectors 
in different situations. Until now we have not discussed the observational consequences of 
having a finite dimensional vector as the state vector. In fact, it is not self evident that 
there exist any physical systems described by such state vectors. A study of atomic spectra 
shows that they do. 

The emission spectra of atoms show a “fine structure” of lines that cannot be explained 
by the usual representation of state vectors viz. single valued wavefunctions [10]. This is 
not a weakness of the principles of quantum mechanics but a result of a system dependent 
assumption made earlier (chapter 3). In fact, in chapter 3, we had made two significant 
system dependent assumptions. 

Assumption 1 All positions on a rectangular coordinate system are measurable (i.e. they 
are eigenvalues of the position operator). 


Assumption 2 Position eigenstates are nondegenerate. 


The first of these two assumptions is seen to be observationally correct for most systems 
that need a quantum description. A counterexample would be the postulated structure of 
the universe in relativistic cosmology. In such a model (Friedmann), space is curved like 
the three dimensional surface of a sphere embedded in four dimensions. If a rectangular 
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coordinate system were to be set up in such a space, the positive coordinate direction 
would loop around and return to the origin! Thus, no coordinate would represent observed 
positions beyond a certain distance. Similar effects of curved space may also be observed 
near massive stars. Since most quantum observations are made in a scale significantly 
smaller than the universe (putting it mildly!) and far enough from massive stars, the 
approximation of a rectangular coordinate system existing, is a very good one. Hence, we 
shall not discuss this matter any further in this text. 

A counterexample of the second assumption is the subject of discussion in this chapter. 
Once the position eigenstates are no longer assumed to be nondegenerate, the wavefunctions 
will not remain single valued. A systematic study of such systems can be seen to explain the 
“fine structure” in atomic spectra and other phenomena that have no classical analog. Such 
systems can also be seen to have observable angular momenta that are not due to spatial 
rotation [10]! The extra degrees of freedom from the degenerate position eigenstates are 
called the SPIN degrees of freedom due to the angular momentum they generate, although 
there is no accompanying spatial rotation. Spin angular momentum can have the half 
integer quantum numbers that were seen to be possible, in general, in chapter 7. 


12.1 Degenerate position eigenstates 


Due to existing experimental evidence, we shall assume the degree of degeneracy d, of 
position eigenstates, to be finite. It is also assumed that d is a constant for a given particle 
at all positions. For example, the electron is known to have d = 2. For a given position 
eigenvalue r, the possible (orthonormal) position eigenstates will now be labelled as \i, r), 
i = 1,2,, d. Then the position representation of any state |s) is defined as 

(i,r|s) = Vbi(r), for i = 1,... ,d. (12.1) 

The different functions ip s i(r) f° r each value of i are called the spinor components. The 
operation of a rotation operator Ur, on |s) has the position representation (i,r|I7^|s) that 
may be written as 

d 

(i,r\U R \s) = ^{U s ) i j^sj{cC 1 Y), (12.2) 

i =i 

where a is the rotation matrix defined in chapter 7. The Us matrix operator is yet to be 
defined. It is to be kept in mind that Ur, a, and Us depend on three parameters given by 
the direction n and magnitude 6 of the rotation i.e. the functional forms of these operators 
can be written as Ur( n, 6), a{ n, 6) and Us{ n, 9). Us is introduced to include the possibility 
that a rotation might transform a single spinor component to a linear combination of spinor 
components. It can be seen to be unitary as Ur is unitary: 

d 

E (UsYjiiUs^k = Sik or U ] S U S = I. 

3=1 


(12.3) 
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We notice that to maintain the physical effect of a rotation, the Us operators for different 
physical rotations must mimic the algebra of the corresponding Ur operators. That is if 


/7 R (n 1 ,0 1 )t/ R (n 2 ,0 2 ) = [/ R (n 3 , 03 ) 


(12.4) 


UsinuOjUsfa, 9 2 ) = Us( n 3 , 0 3 ). (12.5) 

Hence, the set of Us operators for all possible rotations must be a finite dimensional unitary 
representation of the rotation group 50(3). In chapter 7 we have already seen such finite 
dimensional representations, but have not identified them as such. 

It can be seen that the generators of rotation (angular momentum components), on 
operating on the angular momentum eigenstates \l,m), produce a linear combination of 
angular momentum eigenstates that have the same value of l. Hence, a subspace 


V/ — \ ^ ^ C rn E C 


( 12 . 6 ) 


of linear combinations of angular momentum states with fixed l is closed under the operation 
of the angular momentum operators. For example, the operation of L x on the state | l,m) 
can be written as 


L x \l,TTl) ) ' {Clx)mm' l^j ^ )> 


(12.7) 


where Li x is a finite dimensional matrix of dimensionality 21 + 1. This matrix must be a 
representation of L x within the subspace V;. Li y and L[ z would similarly represent L y and 
L~. Exponentiating these angular momentum representations as in equation 7.34, gives 
Usii the representation of the rotation group in the subspace V;. 


U R i(n , 9) = exp(—iL; • n 9/h), 

which can be seen to be a representation of 50(3) from the fact that if 

Ur( ni, di)U R (h 2 , 0 2 ) = U R ( n 3 ,0 3 ), 


( 12 . 8 ) 


(12.9) 


U R i{ni,9i)U R i{n2,0 2 ) = U R i(nz, 0 3 ). ( 12 . 10 ) 

This finite dimensional representation is said to be “carried” by the subspace V;. Two such 
representations, U R i and U R i / placed in a block diagonal form in a matrix of dimensionality 
2(1 + V) + 2, can also be seen to be a representation of 50(3). In fact, any matrix of the 
following block diagonal form, with two or more blocks, can be seen to be a representation 
of 50(3). 

/ U m (n,9) \ 

U m in,9) 

U RB (n,9)= U Rl „(h,9) • ( 12 - n ) 
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where l, l', and l" are the corresponding total angular momentum quantum numbers. Urb 
is a reducible representation of 50(3). A general definition of reducible and irreducible 
representations is as follows. 


Definition 39 Any representation Ua, of a group, that can be reduced to a block diagonal 
form Ub of more than one block (for all elements), by some unitary transformation given by 
a constant unitary operator U as follows, is called a REDUCIBLE REPRESENTATION. 

U B = UU A U t. 


Definition 40 Any representation that is not a reducible representation is called an IR¬ 
REDUCIBLE REPRESENTATION (IRR for short). 

We shall now state two theorems without proof [11, 12]. 

Theorem 12.1 The finite dimensional representations Uri for each l are IRRs. 

Theorem 12.2 Every unitary IRR of the rotation group can be transformed, by a unitary 
transformation, to a Uri for some l. 


A constant unitary transformation of operators as in definition 39 accompanied by the 
transformation 

\s)^U\s), (12.12) 

of all states |s), can be seen to be equivalent to a fixed coordinate transformation and 
hence, does not change any physical relationships. So, group representations that are 
unitarily related, as in definition 39, will be considered identical. 


Returning to equation 12.5, it is now evident that Us must be a finite dimensional 
matrix of the form Urb- The simplest form of Us would be the following cl dimensional 
matrix which has the form of Urb with 1 = 1' = /" = ... = 0. 


U s 


( Uro 

Ur 0 

Uro 


\ 





(12.13) 


URo(n, 0) is seen to be one dimensional and for all n and 6 (see problem 2) 


Uro{ n, 9) = 1. 


(12.14) 
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Hence, Us is an identity matrix and does not mix spinor components in a rotation oper¬ 
ation as given by equation 12.2. Physically this is equivalent to each spinor component 
representing a different scalar particle 1 . Correspondingly, the Ur o representation is called 
the scalar representation as it does not change the components of the wavefunction. It is 
sometimes also called the trivial representation. 

Similarly, any spin system that carries a reducible representation Us, can be split 
into independent systems each of which would carry an IRR of 50(3) that is included in 
the original Us- Hence, one needs to study only those spin systems that carry IRR’s of 
50(3). Accordingly, a spin system is named after the l value (angular momentum quantum 
number) of the IRR that it carries. For example, the l = 0 IRR is carried by systems with 
one spinor component (i.e. the kind we have been studying before this chapter) and they 
are called the spin-zero particles. The spin-half particles carry the l = 1/2 IRR and hence 
have d = (21 + 1) = 2. Similarly, spin-one particles have d = 3, and are sometimes called 
vector particles as usual three dimensional vectors also carry the l = 1 IRR. 


12.2 Spin-half particles 


The simplest nontrivial example of spin is spin-half, which has d = 2 and Us as the l = 1/2 
IRR of 50(3): 

U S = U R 1/2- (12.15) 

Some examples of spin-half particles are the electron, the proton and the neutron. The 
l = 1/2 representation must have 2 dimensional (i.e. 21 + 1) matrices for its generators 
(viz. Li x , Li y , Li z ). For the present case, these generators will be named S x . S y and S z 
respectively and they will be considered to be the components of the vector operator S. 
This is known as the spin operator. Using the standard definition of the angular momentum 
eigenstates | l,m), it can be seen that (see problem 1) 


S x 




(12.16) 


Here, to conform with standard notation for spin operators, the spin-up state (m = +1/2) 
is chosen to have the matrix index 1 and the spin-down state (m = —1/2) is chosen to have 
the matrix index 2 2 . The matrix parts of the above operators are called the Pauli spin 
matrices and they are written as a x , a y , and a z which are the components of the vector 
operator a. Hence, 

S = h -cr. (12.17) 


1 These are particles that have nondegenerate position eigenstates. 

2 Mathematically, the more natural choice would have been the smaller m value for the smaller index. 
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where 


(T rp - 


0 1 
1 0 


a y — 


0 -i 

i 0 


o, = 


1 0 
0 -1 


(12.18) 

The exponential form in equation 12.8, for l = 1/2, will now give Us for the present case. 

Us(n, 9) = exp(— iS ■ hO/h). (12.19) 

As an example, a rotation of 6 about the x axis is seen to be given by (see problem 3) 

U s {i,9) = exp(-iS x 0/h) 

( cos 9 —i sin 0 \ 
l —i sin 0 cos 6 J 

= cos 9 — ia x sin 0, (12.20) 


where a multiplication by the 2x2 identity matrix is implicit for the term that does not 
appear to be a matrix. The two spinor components of the wavefunction, as defined in 
equation 12.1, can be written as a column vector as follows. 

M.)=*.(D =(£$)• (12.21) 

These two-component column vectors are the spinors for the spin-half system. Then, from 
equation 12.2, the rotation operation on ip s (r) can be written as the following matrix 
product. 

(r\U R \s) = U s {a~ 1 r\s) = U s ^s{a~ l r) = U s ( ) ■ ( 12 ' 22 ) 

From the derivation of the rotation operation on a single component wavefunction (sec¬ 
tion 7.4), it can be seen that 

i/ s j(a _1 r) = exp(—iL • h6/h)i/; S i(r), (12.23) 

where L = R x P, is the usual spatial angular momentum operator. Hence, from equa¬ 
tions 12.19, 12.22 and 12.23 we obtain the rotation operation on a spinor to be given by 


(r|= exp[—i(L + S) • n0/7i](r|s). (12.24) 

Thus the rotation operator for such systems can be written as 

Ur( n, 6) = exp[—iJ • n 9/h], (12.25) 

where 

J = L + S. (12.26) 

The components of the vector operator J, are clearly seen to be the generators of rotation 
for spinors. Consequently, from corollary 7.1, one concludes that the components of J are 
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conserved in spherically symmetric systems. From a physical point of view this means that, 
in the absence of external torques, it is J, and not L, that is conserved. Hence, it is J, and 
not L, that must be the observable angular momentum. L, which is sometimes called the 
orbital angular momentum, has a classical analogue that is due to the rotational motion 
of the particle. S, which is sometimes called the spin angular momentum, has no classical 
analogue and is not related to spatial rotation[ 10]! Nonetheless, S is physically significant. 
A spin-half particle, with no orbital angular momentum, will give S as its observed angular 
momentum. For example, a stationary electron will still have an angular momentum. This 
spin angular momentum, in fact, can never vanish because the only possible eigenvalue of 
S 2 (= S • S) is 3h 2 /4. Similarly, the measured values of S x , S y and S z can only be their 
eigenvalues which are h/2 and —h/2. 

Let us now consider a free spin-half particle (e.g. an electron). The hamiltonian 

" = £• < 12 - 27) 

and all other spin independent operators are implicitly assumed to be multiplied by 2 x 2 
identity matrices to define their oparation on spinors. Thus we expect simultaneous eigen¬ 
states of the mutually commuting operator set {H, S 2 , S z . P x . P y . P z }. Using the represen¬ 
tation in the equations 12.16 for spin matrices, one finds the position representation of 
these simultaneous eigenstates to be as follows. 

V> + (r) = (2rt)-»/^ exp < ip o ' r /' l )L (12.28) 

= ( M )" 3/2 ( exp(s ". r/R) ) ■ ( 12 . 29 ) 

where the components of p are the eigenvalues of the corresponding components of P, 
the eigenvalue of H is p 2 /(2m) and the eigenvalue of S 2 is 31 i 2 /4. The eigenvalue of S z 
corresponding to is h/2 and corresponding to if}- is —h/2. 


12.3 Spin magnetic moment (Stern-Gerlach experiment) 


A direct experimental verification of the electron spin is seen with a rather simple setup 
(the well-known Stern-Gerlach experiment[13]). As shown in fig. 12.1, a beam of electrons 
(silver atoms were used in the original experiment) passing through a nonuniform magnetic 
field in the z direction, is seen to split according to the z component of spin of the electrons. 
Such a setup can also be used to demonstrate some of the peculiar properties of quantum 
systems[14], for example, the collapse of a state (postulate 5). This is done by first splitting 
an electron beam according to its two possible ^-components of spin. If the +1/2 spin beam 
is selected out (by blocking the other beam), the electrons in this beam will be in a +1/2 
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Figure 12.1: A Stern-Gerlach setup - an electron beam in the y direction is split in two 
along the z direction by a z direction magnetic field that has a strong uniform part and 
a weak nonuniform part. The nonuniformity is produced by shaping the pole pieces as 
shown. 


spin collapsed state. Hence, a second Stern-Gerlach setup for the 2 -conrponent of spin will 
not split this beam any further. However, if the second setup is for the x-component of spin 
(magnet rotated in the x direction), the beam would split in two once again corresponding 
to the two possible x-components of spin (see problem 7). Each of these beams can now 
be split into both a +1/2 and a —1/2 spin component in the z direction by using another 
z-component Stern-Gerlach setup! 

In this section, the spin magnetic moment will be discussed and a theoretical basis 
for the Stern-Gerlach experiment will be presented. 

A classical particle of mass m, charge q and angular momentum L, has a magenetic 
dipole moment 

M l = JLl. (12.30) 

2m 

For the equivalent quantum system the same relation must be true in an operator sense if 
L is an orbital angular momentum. For spin angular momenta such a relation cannot be 
expected to be true as spin has no classical analogue and is not due to a spatial rotation. 
However, a magenetic dipole moment M s , related to the spin S, is experimentally observed. 
The relation is seen to be 

M s = |1 S , (12.31) 

2m 

where g is a dimensionless constant that is found to be different for different particles. 
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For an electron g = 2. The relativistic quantum theory of the electron (Diracfl]) agrees 
with this experimental value for g (see chapter 13). The Dirac theory cannot explain the 
experimental values of the magnetic dipole moments of the proton or the neutron if they are 
assumed to be structureless particles. This was one of the original reasons for suspecting 
that the proton and the neutron have a substructure. Now it is known that this substructure 
is that of quarks. Quarks are spin-half particles and they have been found to have g = 2 
within experimental error as expected from Dirac’s theory. 

From electrodynamics it is known that a magnetic dipole, of moment M, placed in a 
magnetic field B, has the energy 

H m = -M • B. (12.32) 

This energy term was seen for the orbital angular momentum in our discussion of the 
Zeeman effect (equation 10.49). The electron in an atom has both M/, and M s and hence, 
its magnetic dipole energy is 


Hm = -( M l + M s )-B. (12.33) 

If the charge of the electron is given by —e, then from equations 12.30, 12.31, 12.33 and 
the fact that g = 2 for the electron, one obtains 

Hm = x—(L + 2S) • B. (12.34) 

2m 

Thus we see that the Zeeman effect computation of chapter 10 is not complete. We shall do 
the complete computation later. At present, we notice that equation 12.34 can be applied 
to the Stern-Gerlach setup. 

An electron influenced by no force other than a magenetic field, in a suitable coordinate 
system, has L = 0. Hence, from equation 12.34, in a magnetic field B, it has the magnetic 
energy 

H m = — S B. (12.35) 

m 

If B is a uniform magnetic field in the z direction with a magnitude B, then 


H m = ( eB/m)S z . 


(12.36) 


If this is added to the free particle hamiltonian of equation 12.27 it can be seen that ■f/ , + 
and are still the energy eigenstates but with different eigenvalues. The corresponding 
eigenvalues are 


E, = 


p 2 + eBh 


E _ = 


P‘ 


— eBh 


(12.37) 


2 m 2 m 

This separates the two spin states by energy, but the splitting of the beam as required 
by the Stern-Gerlach setup is still not achieved. To spatially separate the two spin states, 
one can typically have the electron beam in the y direction and include a small nonuniform 
magnetic field in the z direction. The nonuniform part of the field is kept small compared to 
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the uniform part to keep the x and y components of the magnetic field small and ignorable 3 . 
The x and y components must be kept small to make sure that their contribution to H m 
is small enough to consider ip+ and to be the approximate eigenstates. Now, electrons 
of the two spin states can be seen to develop a z component of momentum in opposite 
directions. From equation 4.19 the time rate of change of the expectation value of P z is 
seen to be 


&>- = 


ih 


(12.38) 


The subscript s for the expectation value denotes the state |s) of the system. Each electron 
in the beam is a system in itself and could be in either the state or the state If the 
expectation values in these states are represented as ()±, then equations 12.36 and 12.38 
give 


cl 


(Pz)± = =F 


eh clB 


(12.39) 


dt " 2m dz 

Hence, the two eigenstates develop small z components of momenta in opposite directions 
thus splitting the beam. It is interesting to observe that, although a nonuniform field must 
exist in the x or the y direction (in addition to the one in the z direction as V • B = 0), 
there is no beam splitting in those direction. This is because the strong uniform field in the 
z direction forces the energy eigenstates to be eigenstates of S z (the small nonuniform field 
will perturb this eigenstate only negligibly). If an electron trajectory were to bend in the x 
direction, it would have to be in an eigenstate of S x which would not be an energy eigenstate 
and as energy is being measured inadvertently, this would not be possible (postulate 5). 


12.4 Spin-orbit coupling 


The inclusion of electron spin will change the hamiltonian for the hydrogen atom as dis¬ 
cussed in chapter 8. In the hydrogen atom, from the point of view of the electron, it is the 
proton that orbits around it. If the velocity of the electron is v then it “sees” the proton 
to be moving with respect to itself at a velocity —v. From electrodynamics, it is seen that 
such a moving charge produces a magnetic field of 

B = E x v/c 2 , (12.40) 

where c is the speed of light and E is the electric field due to the charge: 

E = ^r. (12.41) 

3 A nonuniform 2 component of the field will always produce x or y components due to the Maxwell 
equation V ■ B = 0. 

4 Tliis statement needs a subtle explanation. Although ip± are the energy eigenstates (ignoring the 
small effects of the nonuniform field), the system is not expected to be in these states unless an energy 
measurement is made. Hence, it is important to realize that an “inadvertent” energy measurement occurs 
in this experiment - One can deduce the energy from the direction of bending of the beam! So, from 
postulate 5 the system must collapse to an energy eigenstate. 
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Here the eigenvalue r is used instead of the operator R as the position representation is 
the most convenient for these computations. Thus 


B = 


he 

c 2 r 3 


X V = 


k e e 

c 2 mr 3 


L. 


(12.42) 


Using this relation in equation 12.35 gives the magnetic energy of the spin dipole for the 
hydrogen atom: 


Ko = 


k P e 2 


c 2 m 2 r 3 


L S. 


(12.43) 


However, this energy term is not complete. Due to a relativistic effect called Thomas 
precession , the correct energy of the spin dipole is H' so /2. This result will be derived in 
chapter 13. For now, we shall use the following addition to the hydrogen atom hamiltonian. 


H so = f(r )L • S, 


(12.44) 


where 


f(r) 


he 2 

2 c 2 m 2 r 3 


(12.45) 


This additional term changes the energy spectrum of the hydrogen atom only slightly. 
Hence, the term fine structure is used for the spectral lines shifted as a result of this extra 
term. H so is sometimes called the spin-orbit coupling term due to its dependence on both 
the spin and the orbital angular momenta. This treatment of spin-orbit coupling can be 
generalized for other kinds of atoms - in particular alkali metal atoms. An alkali metal atom 
has one outer shell electron that is loosely bound to the rest of the atom and hence the rest 
of the atom can be approximated as a rigid spherical object with some charge distribution. 
This charge distribution produces an electric field that affects the outer electron in a fashion 
similar to the proton of the hydrogen atom. The actual electric field is, of course, different. 
But this is remedied by choosing a suitable function f{r) for each alkali metal. 


Now, for hydrogen and alkali metals the hamiltonian can be written as 


H — Hq + H so , 


(12.46) 


where 


*o = £Ur W . 


(12.47) 


As the effects of spin-orbit coupling are known to be small, one can use the degenerate 
perturbation method to find the corrections to the energy eigenvalues due to H so . To avoid 
complicated determinant computations it is desirable (if possible) to rewrite the degenerate 
eigenstates of Hq as some linear combinations such that they are also eigenstates of H so . 
This is seen to be possible by noticing that 


L • S = (J 2 — L 2 — S 2 )/2. 


(12.48) 
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Hence, J 2 , L 2 and S 2 commute with Hq and H so . So we write the angular part of the 
eigenstates of H 0 as the eigenstates \l,j,m) of the operators {L 2 , S 2 , J 2 , J z }. This is a 
special case of the addition of angular momenta as discussed in chapter 7. Here the two 
angular momenta being added are L and S and their sum is J. The quantum number 
corresponding to S 2 is omitted among the labels of the eigenstate because it is always 1/2. 
The eigenvalue equations for each of the operators define the standard meanings of the 
labels as follows. 


L 2 \l, j,m) 

= 1(1 + l)h 2 \l, j,m), 

(12.49) 

S 2 \l,j,m) 

= (3ft 2 / 4 ) 1 l,j,m), 

(12.50) 

J 2 \l,j,m) 

= j(j + l)h 2 \l,j,m), 

(12.51) 



(12.52) 


Using the Clebsch-Gordan coefficients the states \l,j,m) can be written as linear combina¬ 
tions of the states \l, mi, +) and 1 1, mi, —) that are eigenstates of the operators {L 2 , S 2 , L z , S~} 
The label for S 2 is once again omitted and the rest of the labels are defined by the following 
eigenvalue equations. 


L 2 \l,m h ±) 

= 1(1+ l)h 2 \l,mi,±), 

(12.53) 

S 2 \l,m u ±) 

= (3h 2 /4:)\l, mi, ±), 

(12.54) 

L z \l,m h ±) 

= mih\l,mi,±), 

(12.55) 

S z \l,m h ±) 

= F(n/2)\l,m h ±). 

(12.56) 


Hence, it is seen that j = l ± 1/2. Using the angular momentum eigenstates | l,j,m), 
for a degenerate perturbation computation using H so as the perturbation, one obtains the 
correction terms in energy to be 

A E nlj = F nl [j(j + 1) - 1(1 + 1) - 3/ 4]h 2 /2, (12.57) 

where 

r oo 

F n i= \R n i(r)\ 2 f(r)r 2 dr, (12.58) 

Jo 

and R n j are the radial parts of the corresponding energy eigenfunctions of Hq. 


12.5 Zeeman effect revisited 


In chapter 10 we had discussed the Zeeman effect for spinless electrons. Such electrons 
are, of course, not realistic. To achieve agreement with experiment one needs to use equa¬ 
tion 12.34 for the perturbing hamiltonian due to a uniform external magnetic field B: 

H m = (L + 2S) • B. 


(12.59) 
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Thus an alkali atom outer electron can be described by the complete hamiltonian 


H — Hq + H so + H m . (12.60) 

Now neither of the angular states | l,j, m) or 1 1, m/, ±) are eigenstates of both H so and H m . 
Hence, perturbation computations would require the diagonalization of matrices. However, 
for a weak external magnetic field H so H m and the \l,j, m) states are an appropriate 
choice. Similarly, for strong magnetic fields H so <C H m and the \l,mi,E) are the better 
choice for computations. The Clebsch-Gordan coefficients are used in either case to find 
the effect of an operator on a state which is not its eigenstate. The detailed computations 
will not be shown here (see problem 6). 


Problems 

1. From the definition in equation 12.7, find the matrices Li x , Li y , and L[ z for l = 
0,1/2, and 1. 

2. From the definition in equation 12.8 and the results of problem 1, find Um(n,0) for 
rotations about the x axis (i.e. n = i) for l = 0,1/2, and 1. 

3. Prove equation 12.20. 

4. Prove the following identities: 

(a) a'i = a 2 = a 2 = 1, 

(b) O x&y — ff y <J X — M z , 

(c) a y a z — ffz^y — iffxi 

(d) CT Z (J X - &X&Z - iffy. 

5. Find the spin-orbit interaction energies for the first excited states (n = 2) of the 
hydrogen atom. 

6. Find the weak field Zeeman effect corrections for the first excited state (n = 2) with 
j = 3/2 of the hydrogen atom. 

7. A beam of electrons in the +1/2 state for the ^-component of spin is travelling along 
the y direction. If a Stern-Gerlach setup for the x-conrponent of spin is placed in its 
path, what fraction of the electrons will be seen to have +1/2 spin in the x direction? 



Chapter 13 


Relativistic Quantum Mechanics 


At the end of the nineteenth century there were two major unexplained issues - the black- 
body radiation distribution and electromagnetic theory in the context of Galilean relativity. 
The explanation of the first led to quantum mechanics and that of the second led to special 
relativity. The independent development of the two subjects was quite satisfactory. How¬ 
ever, when a meeting of the two subjects became necessary (for the description of particles 
that travelled at speeds close to that of light), there were serious problems. 

One of the problems arose primarily due to the postulate of collapse of quantum states 
(postulate 5). Such a collapse presumably occurs instantaneously. But relativity does not 
allow any instantaneous movement. The speed of light is a speed limit. This contradiction 
is illustrated by the well known Einstein-Padolsky-Rosen (EPR) paradox. Originally this 
was proposed as a paradoxical gedanken experiment. But more recently, such experiments 
have actually been done [15]. The resolution of the paradox comes from the understanding 
that the special relativistic speed limit is restricted to the movement of information (or 
energy). 

The second problem is more serious. The description of interaction of several rela¬ 
tivistic particles is very tricky even in a classical (non-quantum) setup. In fact, at one time 
a theorem (no interaction theorem[16]) was proved which claimed that no interaction is 
possible between two relativistic particles unless they occupied the same space-time point! 
It was later shown that the conditions used for this proof were too stringent and physical 
reality did not require them. However, it still illustrates the difficulty of introducing inter¬ 
actions at a distance between relativistic particles. The source of this theoretical dilemma 
is the four-dimensional nature of relativistic position. For a relativistic description each 
particle position must include its individual time coordinate. For a physical measurement 
one knows that the time coordinate of all particles must be the same for a given system. 
So, constraints are required for the coordinates of a system of particles. These constraints 
themselves must be relativistically covariant to maintain general covariance. The choice of 
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such constraints is the sticking point. 

A solution to the problem of interaction at a distance is to avoid all interactions at a 
distance. This is done in quantum field theory (QFT). In QFT every interaction between 
particles is considered to be mediated by other particles. For example, electromagnetic 
interactions between electrons are mediated by photons (light particles). An electron pro¬ 
duces a photon (at its own space-time point) and transfers energy and momentum to it. 
The photon travels like a particle to another electron, deposits its energy and momentum 
to it and then disappears. QFT’s have been proposed for every kind of quantum interaction 
- electromagnetic, weak, and strong. However, only the electromagnetic case has had the 
most experimental success. This is called quantum electrodynamics (QED). 

A discussion of QED is beyond the scope of this book[18, 19]. Instead we shall 
discuss the simple case of the quantum mechanics of a single relativistic particle in a fixed 
background potential. This case does not have the problem of interaction at a distance as 
there is only one particle involved. The relativistic hydrogen atom can be approximated to 
be such a system as the proton can be approximated to be almost stationary in the frame 
of reference of the atom due to its significantly larger mass compared to the electron. The 
stationary proton produces the fixed background potential for the orbiting electron. 


13.1 The Klein-Gordon equation 


The intuitive approach to a relativistic quantum theory of a single particle would be to 
write a relativistic form the Schrodinger equation. The form of the Schrodinger equation 
presented in postulate 1 is general enough to include relativity. Just the hamiltonian H 
needs to be appropriately chosen. For the free particle, H could be chosen to be energy 
and formally related to momentum as in special relativity: 

H = ±QP 2 c 2 + m 2 c 4 , (13.1) 

where m is the rest mass of the particle and c the speed of light. The immediate problem 
of this choice is the sign ambiguity. A rather cavalier decision to drop the possibility of the 
negative square root can be made. However, that causes serious mathematical problems 
for the completeness of the eigenstates of the hamiltonian. The other choice is to operate 
the Schrodinger equation (equation 2.1) by H and then use itself a second time to give a 
sign unambiguous equation: 

02 

-h 2 — \s) = H 2 \s). (13.2) 

Using equations 13.1 and 13.2, one obtains the so-called Klein-Gordon equation for a free 
particle: 

I s ) = (P 2 c 2 +m 2 c i )\s). 


(13.3) 
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Using the position representation, this equation takes the following form. 



c 2 dt 2 J 


4 ', 


m 2 c 2 

-JF 


T, 


(13.4) 


For m = 0, this is exactly the linear wave equation for a wave that travels at the speed of 
light c. 


For the case of light, Ts is the relativistic four component vector potential (/j, = 
0,1,2,3) for electromagnetic fields. The zeroth component A 0 = 4>/c where $ is the 
standard scalar potential and the other three are the three components of the magnetic 
vector potential A. The correspondence is sometimes written as A 4 = (4>/c, A). The 
notation here is obvious and can also be used to depict the four component forms of other 
vectors. For example, the position four-vector is x ^ = (ct, r) where t is time and r is 
the usual position vector. A lower index form of these vectors is sometimes defined for 
convenience by changing the sign of the zeroth component 1 . For example, for position 
x M = (— ct, r). Besides brevity, the four-vector notation has another benefit. It makes it 
easier to keep track of relativistic consistency of equations - and all physical equations must 
be consistent with relativity. Such consistency is sometimes called covariance. A covariant 
equation, written in the four-vector form, does not change in form in different frames of 
reference. For example, equation 13.4 can be written as 


d 2 m 2 c 2 

dxi- l dx fl s h 2 Sl 


(13.5) 


where it is implicitly assumed that the left hand side is summed over all four values of //. 
This assumption will be made whenever a contravariant index and a covariant index are 
the same and appear in a product of components as in the above equation. This is called 
the Einstein summation convention and is used extensively to avoid repeated use of the 
summation sign. For the case of light the corresponding equation is 2 


d 2 

dx^dx^ 


A v = 0. 


(13.6) 


These are four equations for the four components of A v . It is interesting to note that, 
unlike the usual wavefunctions, the A v for light are actually measurable 3 ! 


It is also to be noted that A u has four components - somewhat like the two-component 
nature of spinors. This provides a hint about the spin of the light particle - the photon. It 

1 The upper index vector is called a contravariant vector and the lower index vector is called a covariant 
vector. Different conventions for the definitions of such vectors may be found in other texts. A more general 
approach to such vectors is used in the general theory of relativity. 

“The Lorentz gauge is chosen liere[17]. 

3 Of course, there is some ambiguity due to gauge choice and the true measurable quantities are the 
electric and magnetic field vectors. 



CHAPTER 13. RELATIVISTIC QUANTUM MECHANICS 


165 


can be shown that the photon is a spin-one particle[18]. It can also be shown that a particle 
with a scalar (relativistic) wavefunction has zero spin[18]. In fact, it is possible to write 
down Klein-Gordon equations for particles of any spin. However, it is seldom necessary 
to go beyond spin-one as all known fundamental particles have spins of one or less 4 . For 
spin-half particles, the wavefunction is a four component object, but it is not a four-vector. 
It is called a Dirac spinor. To understand the component structure of wavefunctions for 
different spins, one could look back at the two-component spinors (Pauli spinors) discussed 
in chapter 12. The Pauli spinors were seen to carry the two-dimensional IRR of the SO(3) 
group. In fact, it was seen that for every IRR of the SO(3) group, there exists a possible spin. 
An extension of the SO(3) group is the group of all possible special relativistic coordinate 
transformations (not considering translations). This is the so-called Lorentz group. Every 
IRR of the Lorentz group corresponds to a possible relativistic spin. The spin-zero IRR is 
one dimensional (the scalar wavefunction), the spin-half IRR is four-dimensional (the Dirac 
spinor) and the spin-one IRR is also four dimensional (relativistic four-vector). The Dirac 
equation (not the Klein-Gordon equation) happens to be the natural choice for spin-half 
particles and it will be discussed in the next section. 

For the free Klein-Gordon particle, the hamiltonian commutes with the momentum 
operator and hence, as expected, momentum is conserved. The momentum eigenstates |p) 
are also energy eigenstates and the energy-momentum relationship is found as follows. 

E|p) = H\p) = ±\/ P 2 c 2 + m 2 c 4 |p) = p 2 c 2 + m 2 <A\p), (13.7) 

where, as before, p is the eigenvalue of the momentum operator P. Hence, 

E = ±\Jp 2 c 2 + m 2 c A . (13.8) 

It is seen that the energy could be either positive or negative. It has a minimum positive 
value of me 2 and a maximum negative value of —me 2 . For free particles, negative energy 
is physically meaningless. However, the negative energy eigenvalues cannot be mathemati¬ 
cally ignored as they are needed for the completeness of the energy eigenstates. A similar 
dilemma for the Dirac equation had prompted Dirac to postulate what are now known 
as antiparticles. For every particle of energy E there exists an antiparticle with identical 
properties with energy —E. An antiparticle travels backward in time and hence, from the 
physical point of view of time translation (equation 7.23), it would appear to be a particle 
of positive energy travelling forward in time. This resolves the physical problem of nega¬ 
tive energy, but introduces the need to experimentally detect the postulated antiparticles. 
Since the original postulate by Dirac, antiparticles have been amply detected (for example 
antiprotons, antineutrons, antielectrons (or positrons) etc.). 

The energy-momentum relationship of equation 13.8, can also be written in a covariant 
four-vector form. The zeroth component of the momentum four-vector is related to energy. 

4 Gravitons, if they exist, must have spin-two. Sometimes some tightly bound composites like mesons, 
baryons or even atomic nucleii might be approximated to be single particles with higher spin values. 
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The four-vector momentum is p^ = (E/c, p). Then the covariant form of equation 13.8 is 

p tl p, x + m 2 c 2 = 0. (13.9) 

It is possible to find energy eigenvalues for Klein-Gordon particles in the presence of 
static background potentials. However, we shall not do it here as such problems are of 
little practical value. The hydrogen atom problem is that of an electron in a background 
potential. But the electron has a spin of half and hence, obeys the Dirac and not the 
Klein-Gordon equation. It is possible to have a pi meson (pion) orbit around a proton to 
simulate the Klein-Gordon situation. But this situation is further complicated by other 
possible interactions of a pion that can be dealt with only by a quantum field theory. The 
photon interacting with a static charge may be considered to be an interacting Klein-Gordon 
particle. But such a problem is nothing more than the case of electromagnetic fields in the 
presence of charge sources which is adequately dealt with in all electromagnetic theory 
texts. 


13.2 The Dirac equation 


The square root of an operator as shown in equation 13.1, is a mathematical complica¬ 
tion that was avoided by the Klein-Gordon equation by squaring the operator. Dirac[l] 
avoided the problem by choosing a hamiltonian that is linear in the momentum P. This 
choice turned out to be the correct one for spin-half particles like the electron. The Dirac 
hamiltonian for a free particle is as follows. 

H = col • P + /3mc 2 , (13.10) 

where a and [3 are constants yet to be determined. This is the simplest linear relationship 
possible as me 2 must be the rest energy of the free particle. Using H from equation 13.10 in 
equation 2.1 gives the so-called Dirac equation. Once again, H and P are seen to commute 
and hence, they have simulataneous eigenstates that can be labelled by the momentum 
eigenvalues p and written as |p). Then the energy eigenvalue equation will give: 

(E — ca ■ p — f3mc 2 ) |p) = 0. (13.11) 

This provides an energy-momentum relationship that does not resemble equation 13.8 in 
any way. But equation 13.8 is a relativistic kinematic equation for all free particles and 
hence, must be true in this case as well. The situation is saved by considering (3 and the 
components of a to be matrices of dimensionality greater than one and the state vectors 
to have a column vector nature of the same dimensionality. With this assumption, if we 
multiply equation 13.11 from the left by (E + ca • p + (3mc 2 ), we get: 

[E 2 - c 2 (a 2 p 2 + a\p\ + a 2 3 p 2 3 + (a±a 2 + a 2 a 1 )p 1 p 2 
+ (a 2 a 3 + a 3 a 2 )p 2 p 3 + (a 3 a l + a 3 a 3 )p 3 pi ) - m 2 c 4 /3 2 
- mc 3 ((aiP + (3ai)pi + (a 2 (3 + f3a 2 )p 2 + (a 3 P + f3a 3 )p 3 )] |p) = 


0, (13.12) 
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where 07, (i = 1, 2, 3) are the three components a x , a y and a z of a. Now, if equation 13.8 
were to be satisfied the following relations must be true. 


«■ = P 2 

OLiOij ~\~ OijOL{ 

ati(3 + flat 


1, for i = 1, 2, 3, 

0, for i j and i,j = 1, 2, 3, 
0, for i = 1, 2, 3. 


(13.13) 

(13.14) 

(13.15) 


As the hamiltonian must be hermitian, the four matrices a* and (3 must be hermitian. 
Then, from the above equations, it can be shown that these matrices must be traceless and 
their only possible eigenvalues are ±1 (see problem 1). Hence, their dimensionality must 
be even. For the simplest possible theory, one chooses the lowest possible dimensionality. 
However, a dimensionality of 2 does not work. This is because two-dimensional hermitian 
traceless matrices can have only three independent parameters: 


( c a — ib 
l a + ib —c 


(13.16) 


where a, b and c are real and a is an arbitrary two-dimensional hermitian traceless matrix. 
So, a can be written as a linear combination of the Pauli spin matrices defined in chapter 12: 


a = aa x + ba y + ca z . 


(13.17) 


Then, it can be shown that four such matrices (07 and /3) ca nn ot be found such that 
they satisfy the conditions of equations 13.14 and 13.15 (see problem 2). Hence, we pick 
the next simplest choice for the dimensionality - namely 4. For four-dimensional matrices, 
there are an infinite number of possibilities that will satisfy the conditions of equations 13.14 
and 13.15. However, all such possibilities can be seen to produce the same physical results 
(namely, the eigenvalues of observables). Hence, we shall choose one convenient form of 
these matrices: 





0 (j 2 \ 

<Tz 0 )’ 


P 


1 0 
0 -1 


(13.18) 


where each entry in the matrices is a 2 x 2 matrix - 0 represents the zero matrix, 1 represents 
the identity matrix and the others are the standard Pauli spin matrices (equation 12.18). 
A more compact form is given by the following vector notation: 


a = 


0 (7 \ 

cr 0 ) ’ 


P 



(13.19) 


Now that the hamiltonian operator has a 4 x 4 matrix aspect to it, the state vector 
must be a corresponding 4-component object. Such a state vector is called a Dirac spinor. 
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Accordingly, |p) must be a 4-component column vector. As this is still a momentum 
eigenstate, its position representation can be written as: 


( r |P> 


I Ul \ 


U 2 

Vi 


exp ( ip • r/h). 


\ / 


(13.20) 


Inserting this in the energy eigenvalue equation (equation 13.11) and using equation 13.18, 
one obtains the following matrix equation for the tq. 


( (E — me 2 ) 

0 

-cp 3 

-c(pi - ip 2 ) \ 

( Ul \ 

0 

(E — me 2 ) 

c(jPi + W 2 ) 

cp:i 

U 2 

-CP3 

-c(pi - ip 2 ) 

(E + me 2 ) 

0 

Vi 

V -c(pi + i'P 2 ) 

CP3 

0 

(E + me 2 ) J 

V U 4 J 


These are a set of homogeneous equations in the tq and hence, for a non-zero solution to 
exist, the determinant of the matrix must vanish. This condition leads to the following 
solutions for the energy eigenvalue: 


E + 


+ \J c 2 p 2 + m 2 c 4: , E_ = — \J c 2 p 2 + m 2 c 4 . 


(13.22) 


Equation 13.21 also gives two possible eigenstates for each of these eigenvalues. Each 
eigenstate can be written as the column vector formed by the four components tq. For E + 
the eigenstates are: 


»+'l = 


( 

1 

\ 


( 

0 

\ 


0 




i 



C P3 


, U + _ = 


c(pi-ip2) 



E^+mV 




E++mc 2 


V 

c(pi+ip2) 

J 


V 

-cp 3 

) 

E++mc 2 


E+-\-mc 2 


For E _ the eigenstates are: 



( cp 3 \ 

E~ —me 2 


/ c(pi-ip2) \ 
1 E- —me 2 


c(p\+ip2) 


-Cp3 

U-+ = 

E- —me 2 

, = 

E-—mc 2 


1 


0 


VO J \ 1 ) 


(13.23) 


(13.24) 


The first subscript for the eigenstate gives the sign of the energy and the second gives 
the sign of the ^-component of its spin. The relation between particle spin and the Dirac 
spinor will be discussed later. For now, we notice that the negative energy states cannot 
be avoided even by a hamiltonian linear in momentum. So, once again, they are to be 
explained as antiparticles as discussed in the case of Klein-Gordon particles. 


It is to be noted that the relativistic covariance of the Dirac equation is imposed 
through the conditions in equations 13.13, 13.14 and 13.15. A more manifestly covariant 
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form of presenting all relevant equations is possible. For example, equation 13.11 could be 
multiplied from the left by —/3/c to obtain 

(7^ + mc ) Ip) = (13.25) 

where 

7 „ = (-/?,/?<*), (13.26) 

and, as before, = (E/c, p). However, in this text, we shall not use this notation. It is too 
compact for an introductory discussion. Once the student becomes reasonably comfortable 
with manipulations of the standard Dirac matrices a and /?, he/she can use the more 
compact and manifestly covariant formulation. 


13.3 Spin and the Dirac particle 


If a Dirac particle is placed in a spherically symmetric potential V, its angular momentum 
must be conserved (see chapter 7). The relevant hamiltonian would be 

H = ca- P + pmc 2 + V. (13.27) 

To be conserved, the angular momentum operator must commute with the hamiltonian. 
The orbital angular momentum L = R x P, by itself, does not commute with H. This 
is seen by first noticing that [L, V] = 0 (see problem 3) and then computing [L, if]. For 
example, for the x component 


[f-^jff] \L x ,col • P] ihc(a y P z cx z Py )• 
So, in general, for all three components: 

[L, if] = ihcoc x P. 


(13.28) 


(13.29) 


Hence, quite clearly, L is not the complete angular momentum of the particle, although it 
must be part of it. The remaining part of the total angular momentum can be seen to be 


where 



Now, if the total angular momentum is defined as 


(13.30) 

(13.31) 


J = L + S 


(13.32) 
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it is straightforward to show that (see problem 4) 

[J, H] = 0, (13.33) 

and hence, J must be the conserved total angular momentum. S is the spin angular 
momentum. The block diagonal form of S shows that the first two and the last two 
components of the Dirac spinor see the effect of S in pairs exactly the same way as the 
Pauli spinors see the effect of the two-dimensional spin operator (see chapter 12). Thus it is 
very satisfying to see spin angular momentum naturally built into the Dirac hamiltonian. In 
the following, it will be seen that the Dirac hamiltonian also includes the correct spin-orbit 
coupling term and the correct magnetic moment of the electron. 


13.4 Spin-orbit coupling in the Dirac hamiltonian 


As discussed in chapter 12, a nonrelativistic analysis misses a factor of half in spin-orbit 
coupling. We shall now see that this extra factor, due to Thomas precession, is built into the 
Dirac hamiltonian. To recognize the spin orbit term, as seen in chapter 12, we need to see 
a nonrelativistic approximation of the Dirac energy eigenvalue problem with a spherically 
symmetric potential. The eigenvalue equation is 

E\E) = (ca • P + /3mc 2 + V)\E), (13.34) 

where the hamiltonian from equation 13.27 is used with the standard notation | E) for 
the energy eigenstate. Unlike in the free particle case, | E) is not the same as |p). In a 
nonrelativistic limit, the first two and the last two components of the Dirac spinor | E) will 
be seen to decouple. So, for convenience, we shall write 


I E) = 



(13.35) 


where V\ and v? are two-component Pauli spinors. First, let us assume | E) to be a particle 
state (not antiparticle). Then E must be positive. In a nonrelativistic limit, the rest mass 
energy would constitute most of the energy. The quantity that was considered as energy 
in our earlier nonrelativistic work did not include the rest mass energy. That quantity will 
now be called E': 

E' = E — me 2 . (13.36) 

So, E' <C me 2 . Now, equation 13.34 can be written as a pair of Pauli spinor equations 
using equations 13.35 and 13.36: 


(. E' — V)v\ — ccr • Pi>2 = 
(E' + 2 me 2 — V)v2 — ccr • Pr>i = 


0 , 

0 . 


(13.37) 

(13.38) 
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The second of this pair of equations shows that, in the nonrelativistic limit, v 2 is smaller 
than V\ by a factor of the order of v/c where v is the speed of the particle 5 . For antiparticles, 
it is the reverse (see problem 7). So, for particles, V 2 is eliminated from equation 13.37 by 
using equation 13.38. The result is 

E ' v ' = ' p > (‘ + w) ' *>■ + ^ < 13 - 39) 


The actual approximation is given by the following: 

/ E'-V \- 1 ~ _ E' -V 

\ ^ 2 me 2 ) 2me 2 


(13.40) 


The approximation condition is (E' — V) <C 2 me 2 . To reduce equation 13.39 to a form simi¬ 
lar to the standard nonrelativistic equation, the following identities are used (see problems 5 
and 6). 


PV = VP-ihW, 

(cr • VV)(tr • P) = (W) -P + ia- [(W) X P], 


(13.41) 

(13.42) 


Now, using equations 13.40, 13.41 and 13.42 in equation 13.39, we obtain: 


E'v 1 = 


E’ ~V\ P 2 ' 

2 me 2 ) 2m 


ih 


V\ - 


4 m 2 c 2 


(W)-Pui + 


h 


4 m 2 c 2 


cr • [(VV) X P] '(q . (13.43) 


As P 2 /(4m 2 c 2 ) is already small of the order of v 2 /c 2 , the (E' — V ) can be further approxi¬ 
mated to be P 2 /{2m). Also, for a spherically symmetric potential 


W 


RdR ’ 


(13.44) 


where R = \/R • R. So now, equation 13.43 can be written as 


E'v PA V - iH 

1 y2 m 8 m 3 c 2 P 4 m 2 c 2 


(W) - P + 


1 dV 


2m 2 c 2 R dR 


S-L ui, 


(13.45) 


where S = ha/2 is the Pauli spinor form of the spin operator and L = R x P is the orbital 
angular momentum operator. The last term in equation 13.45 is seen to be the correct 
spin-orbit coupling term as discussed in chapter 12. The first and third terms are the 
standard nonrelativistic hamiltonian terms and the remaining two terms have no simple 
nonrelativistic interpretation. 

5 Consider the P operator to produce a factor of the order of mv where v is the velocity. 
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13.5 The Dirac hydrogen atom 

The nonrelativistic approximation of the last section is useful to have for general spherically 
symmetric potentials. Such potentials are good approximations for alkali atoms where the 
single outer shell electron can be treated as a single particle in the spherically symmetric 
background potential of the nucleus and the other filled shells of electrons. However, for 
the specific case of the hydrogen atom, the potential is simple and the energy eigenvalue 
problem can be solved exactly [8]. 

In doing this, we shall first separate the angular and the radial parts of the Dirac 
hamiltonian as given in equation 13.27. The radial component of momentum in classical 
physics is written as p r, where r is the unit vector in the radial direction. For the quantum 
analog, it might be tempting to just replace p and r by their corresponding operators. 
However, such a representation of the radial momentum can be seen to be nonhermitian 
due to the noncommuting nature of position and momentum (see problem 8). In general, 
to obtain a quantum analog of a product of classical observables, one picks the hermitian 
part. For example, for two hermitian operators A and B , the hermitian part of AB is 
(AB + BA)/ 2 and the antihermitian part is (AB — BA)/2 (see problem 9). So the radial 
momentum would be 

p -=K p 'l + ?' p )’ <i3 - 46) 

where R = \/R • R. This can be simplified as (see problem 10) 

p r = -3-(R • p - ih). (13.47) 

K 

The radial component of a has a simpler form as it commutes with R: 

a r = a • R/ R. (13.48) 

For a spherically symmetric potential the angular part of the Dirac hamiltonian must be 
contained in the ca • P term. To isolate this angular part, we subtract out the radial part 
ca r P r . So, the angular part of the hamiltonian is 

H a = c(a ■ P — a r P r ). (13.49) 

To find the relationship of H a and P r , we notice that a/. = 1 and hence, 

a r H a = c(R~ 1 a ■ Ret • P — P r ). (13.50) 

The first term on the right hand side can be simplified by using the following identity (see 
problem 11): 

a Aa B = A B + icF (A x B), (13.51) 

where A and B are two arbitrary vectors with no matrix nature relating to a. Hence, 
equation 13.50 reduces to 


a r H a = cR 1 i(cr' • L + h). 


(13.52) 
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where L = R x P. Multiplying both sides by a r gives 

H a = cR~ l ia r {a' • L + h). (13.53) 

The angular part of this is in the expression within the parenthesis. Let us call it HK': 

UK’ = cr' ■ L + fr. (13.54) 

If K’ were to commute with H, we could find their simultaneous eigenstates and replace 
K 1 by its eigenvalue in equation 13.53. This will be seen to make the energy eigenvalue 
problem a differential equation in the radial coordinate alone. However, to form such a 
conserved quantity K 1 must be multiplied by (3. Hence, we define the conserved quantity 
K (see problem 12): 

K = plC = p(a' • L /h + 1). (13.55) 

As P 2 = 1, we can now write 

H a = cR~ 1 iha r pK. (13.56) 

Now, using equations 13.27 13.49, and 13.56, we obtain 

H = ca r P r + cR~ l iha r pK + Pmc 2 + V. (13.57) 

Then, the energy eigenvalue problem can be written as 

E\E) = ( ca r P r + cR~ l iha r pk + Pmc 2 + V)\E), (13.58) 

where k is the eigenvalue of K. and | E) represents simultaneous eigenstates of H and K. 
To reduce the equation to a differential equation we use the position representation and 
the spherical polar coordinates. The position representation of P r in polar coordinates can 
be found to be (see problem 13): 

Pr = (Wr + F) ' <13 - 59) 

The only remaining matrix behavior is from a r and p. These two matrices commute with 
everything other than each other. The necessary relationships of a r and p are: 

a r P + Pa r = 0, a 2 =p 2 = l. (13.60) 

As seen in section 13.2, these are the only relations that are necessary to maintain the 
physical correctness of the Dirac equation. The actual form of the matrices can be picked 
for computational convenience. In the present case such a form would be: 



(13.61) 
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where each element is implicitly multiplied by a 2 x 2 identity matrix. With a similar un¬ 
derstanding, the position representation of the Dirac spinor can be written as the following 
object: 


(v\E) 


F/r \ 
G/r ) ’ 


(13.62) 


where F and G are functions of the radial coordinate r alone. The 1/r term is separated 
out to make the differential equations a little more compact. Using equations 13.59, 13.61 
and 13.62, the energy eigenvalue problem of equation 13.58 can be written in the position 
representation to be: 


(E -me 2 -V)F + Hc— + - G = 0, 

dr r 

(E + mc 2 -V)G-hc C ^ + —F = 0. 

dr r 


(13.63) 

(13.64) 


These are a pair of coupled first order ordinary differential equations. To find solutions, it 
is, once again, convenient to define a dimensionless independent variable: 


p = 8 r. 


(13.65) 


Inserting this in equations 13.63 and 13.64, and requiring p to be dimensionless in the 
simplest possible way, gives 


8 = +\Z8i8 2 , Si = 


me 2 + E 
he ’ 


82 = 


me 


E 


he 


Now, the equations 13.63 and 13.64 can be written as 

'& 


d k 

dp p 

d k 


G- 


i + N) f - 

8 hc8 


F = 0, 


dp p 


— \g = 0 


V 


hc8 


(13.66) 


(13.67) 

(13.68) 


For the specific case of the hydrogen atom the spherically symmetric potential is V = 
—k e e 2 /r , where —e is the electron charge and k e = l/(47reo) in the usual SI units. Now, a 
convenient dimensionless parameter can be defined as: 


a = ^-. (13.69) 

he 

This is the so-called fine structure constant that appears in the fine structure splitting terms 
of atomic spectra. It is roughly equal to 1/137. The smallness of this number is critical to 
many power series approximation methods used in quantum physics (in particular quantum 
electrodynamics). For the nonrelativistic hydrogen atom (and the harmonic oscillator) a 
standard method was used to separate the large distance behavior of the eigenfunctions. 
The same method can be used here as well to separate the two functions F and G as: 


F(p) = f(p) exp(-p), G(p) = g{p) exp(-p). 


(13.70) 
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Using equations 13.69 and 13.70 in the two eqautions 13.67 and 13.68, we obtain: 

(13.71) 

(13.72) 

A standard power series solution may be assumed for / and g: 

oo oo 

f = P s ^2 a iP\ g = p s 'STb i p\ (13.73) 

i =0 i =0 

where 7 ^ 0 and ao 7 ^ 0. Inserting this into the two differential equations for / and g and 
collecting terms of the same powers of p gives the recursion relations: 

8 2 

(s + i + k)bi - 6 *_i + aai — r a*_ 1 = 0 , 

0 

£ 

(•s + i - k)ai - a t -\ - abi - -jh-i = 0, 

for i > 0. The lowest power terms give the following equations. 

(s + k)bo + aciQ = 0 , 

(s — k)cio — abo = 0 . 

A non-zero solution for clq and bo can exist only if 

s = ±Vk 2 -a 2 . (13.78) 

It will soon be seen that k 2 > 1. Hence, to keep / and g from going to infinity at the origin, 
we must choose the positive value for s. With this choice, a relationship of ciq and bo can 
be found. For the other coefficients, one can find from equations 13.74 and 13.75 that 


(13.76) 

(13.77) 


(13.74) 

(13.75) 



bi[ 6 (s + i + k ) + 82 a] = di[ 82 (s + i — k ) — 8 a\. (13.79) 

Using this back in the the same two equations, gives a decoupled pair of recursion relations. 
What we need to see from such equations is the behavior of a* and bi for large i: 

2 2 

ci{ — —cii—h bi — —i- (13.80) 

1 1 

This shows that both series behave as exp(2p) for large p. Hence, to keep the eigenfunctions 
from going to infinity at large distances, the series must terminate. Fortuitously, this is 
seen to happen to both series with just one condition. If both series were to terminate at 
i = n! such that a n i + \ = b n > + i = 0 , then it is seen that all subsequent terms in the series 
also vanish. For different values of n’ we obtain different eigenvalues and eigenfunctions. 
By using either of the equations 13.74 and 13.75 for n’ = i — 1 we get 


b ! 2^n' — 8b n >, 


n' = 0,1,2 ,.... 


(13.81) 
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Using equation 13.79 for i = n' along with this gives 

2 8(s + n') = a{8\ - 8 2 ). 


(13.82) 


Writing 8, 8± and 8 2 in terms of the energy E (equation 13.66), and solving for E shows 
that it is positive and given by 


E = me 2 


1 + 


a 


(s + n') 2 


-1/2 


(13.83) 


That E is positive shows that positrons (electron antiparticles) cannot form bound states 
with a proton potential 6 . 


In equation 13.83, s depends on k. So, we need to find the possible values of k. 
From equation 13.55, it is seen that K 2 is related to the magnitude of the total angular 
momentum as follows. 


K 2 = h- 2 [(a' -L) 2 + 2ha' -L + h 2 } 

= h~ 2 [(L 2 + 2S-L + H 2 } 

= n~ 2 [(L + s) 2 + n 2 /4] 

= n~ 2 [j 2 + n 2 /4\, (13.84) 


where the result of problem 6 is used with the knowledge that L x L = iKL. As the 
eigenvalues of J 2 are j(j + 1) (j = 1/2,1, 3/2, 2,...), the eigenvalues K 2 are seen to be 

k = ± 1,±2,±3,.... (13.85) 


Although we have found only the values of k 2 , both positive and negative roots for k are 
considered eigenvalues. This is because the form of the operator K shows that both signs 
are equally likely for its eigenvalues. 


Equation 13.83 agrees very well with experiment including the fine structure splitting 
of energy levels. To see its connection with nonrelativistic results one may expand it in 
powers of a 2 (remember the a 2 dependence of .s). Keeping terms of upto order a 4 , this 
gives 


E = me 2 1 



a 4 / n 
2n 4 \\k\ 



(13.86) 


where n = n’ + \k\. The second term gives the usual nonrelativistic energy levels and the 
third term gives the fine structure splitting. 


6 Note that E includes the rest mass energy and hence cannot be negative for electrons, although E < me 2 . 
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13.6 The Dirac particle in a magnetic field 

The effect of a magnetic field is introduced in the Dirac hamiltonian in a manner similar 
to classical mechanics. The momentum P is replaced by P — qA where q is the particle 


charge and A is the magnetic vector potential. So, the hamiltonian becomes 

H = col • (P — qA) + /3mc 2 . (13.87) 

To isolate and recognize the spin magnetic moment term, we need to find the nonrelativistic 
limit. This is done by squaring the hamiltonian to get 

H 2 = c 2 [ol • (P — qA)} 2 + m 2 c 4 , (13.88) 

where the identities in equations 13.13 and 13.15 have been used. Equation 13.51 gives 

[a • (P - qA)} 2 = (P - qA) 2 + icr' ■ [(P - qA) x (P - qA)}. (13.89) 

Using the result of problem 5 it is seen that 

(P — qA) x (P - qA) = —q(A x P + P x A) = ihq'V x A = itiqH. (13.90) 

where B is the magnetic field. Using equations 13.89 and 13.90 in equation 13.88 gives 

H 2 = c 2 (P — qA) 2 — hqc 2 cr' ■ B + m 2 c A . (13.91) 

Hence, the energy eigenvalue equation is 

E 2 \E) = [c 2 (P-qA) 2 -hqc 2 a' -B + m 2 c A ]\E). (13.92) 


For positive particle energies, we can once again write E = E’ T me 2 , where E 1 is the energy 
as defined in the nonrelativistic limit. As E 1 <C me 2 in the nonrelativistic limit, one may 
approximate equation 13.92 to be 

^ | E) = [c 2 (P — qA) 2 — hqc 2 a' ■ B + m 2 c A ]\E). (13.93) 

This leads to 

E’\E) = [—(P - qA) 2 - — S • b1 | E). (13.94) 

|_2m m J 

where S is the spin operator. The interaction term of spin and magnetic field gives the 
experimentally correct expression for the magnetic dipole moment of an electron (see chap¬ 
ter 12). 
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Problems 

1. Show that, in order to satisfy equations 13.13, 13.14 and 13.15, the matrices a* 
and (3 must be traceless and their only possible eigenvalues must be ±1. [Hint: 
Show, for example, /3 = Oi\Oi\(3 = — a\(3ai and use the cyclic property of the trace: 
Tr(a/? 7 ) = TtQ/u(3).\ 

2. Show that the matrices a* and (3 ca nn ot satisfy the equations 13.14 and 13.15 if they 
are two-dimensional. [Hint: Use the general form given by equation 13.17.] 

3. A spherically symmetric scalar potential V is a function of R • R alone. Show that 
all components of the angular momentum L commute with V, that is 

[L, V] = 0. 

[Hint: Assume V to be a power series in R • R. A Taylor series expansion about a 
suitable origin is used to avoid negative powers.] 

4. Prove that the total angular momentum J of the Dirac particle is conserved for a 
spherically symmetric potential. 

5. If V is a function of position alone, show that 

[P, V] = —ih'VV. 

[Hint: Assume V to be a power series in each of the three position coordinates. A 
Taylor series expansion about a suitable origin is used to avoid negative powers.] 

6. If A and B are two vectors with no matrix nature related to spin matrices, then show 
that 

(er • A)(er • B) = A • B + ia ■ (A x B). 

[Hint: Use the results of problem 4 of chapter 12.] 

7. Find the nonrelativistic limit of equation 13.34 for antiparticle states. 

8. Show that the following operator analog of the radial momentum is nonhermitian. 

r . * 

Vb~r 

9. Show that (AB + BA )/2 is hermitian and (AB — BA)/2 is antihermitian if A and B 
are hermitian operators. [Definition: An operator C is antihermitian if C = —CQ\ 

10. Using equation 13.46, show that 

P r = — (R • P — ih). 

R 

[Hint: Operate on an arbitrary state in the position representation.] 
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11. Prove the identity in equation 13.51. 

12. Prove the following relations: 

(a) [a r ,K\= 0, 

(b) %K\ = 0, 

(c) [P r ,K\= 0, 

(d) [a r ,P r \= 0. 


13. Prove that in spherical polar coordinates the position representation of P r is 




Appendix A 


‘C’ Programs for Assorted 
Problems 


The following ‘C’ programs are not particularly “user-friendly” or “robust”. They are 
presented in this form so that students can easily identify the essential components of the 
numerical methods involved. The code is meant to be used in conjunction with the material 
in the text. Definitions of parameters must be understood before using the programs. 
Comments are provided to help do this. 


A.l Program for the solution of energy eigenvalues for the 
rectangular potential well 


#include <stdio.h> 
#include <math.h> 


void mainO 

{ 

float mid, lhs, rhs, inter, acc, gam, xi; 
int n; 

printf("\n Enter potential parameter gammma\n\n"); 

scanf ("”/ 0 f" ,&gam); 

printf("\n enter accuracy \n\n"); 

scanf ("°/ 0 f" ,&acc); 
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n=0; 
xi =0; 
inter=PI/2; 
while (xi <= gam) 

{ 

while (inter > acc) 

{ 

inter /= 2; 
mid = xi + inter; 
if (mid < gam) 

{ 

lhs = mid*tan(mid); 

rhs = sqrt(gam*gam - mid*mid); 

if (lhs < rhs) xi = mid; 

} 

} 

printf("\n The °/ 0 d th root for xi is °/ 0 f\n", n, xi) ; 
n++; 

xi = n*PI; 
inter = PI/2; 

} 

} 


A.2 General Program for one dimensional scattering off ar¬ 
bitrary barrier 


#include <stdio.h> 

#include <math.h> 

double ul,u,uu,vl,v,vu; 

void main() 

{ 

double e,vt,del,ki,kt; 
double din,ud,vd,ref,tran; 
extern void scatterO ; 

/* scatterO defines the specific potential to be used. See next 
listing for the case of rectangular barrier. */ 
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printf("enter data\n\n e,vt,del\n\n"); 

/* Choose units such that sqrt(2m)/hbar = 1. */ 

/* ‘e’ is energy of incoming particle in above units. */ 

/* ‘vt’ is potential energy in scattered region in above units. */ 

/* 'del’ is interval in computation of dimensionless variable y of text. */ 

scanf("°/ 0 lf 7.If # /.lf" ,&e,&vt,&del); 


ki = sqrt(e); 
kt = sqrt(e-vt); 

ul = 1; u = 1; 

vl = 0; v = -del*kt/ki; 

scatter(e,del); 

/* scatterO defines the specific potential to be used. See next 
listing for the case of rectangular barrier.*/ 

ud = (ul-u)/del; vd = (vl-v)/del; 
din = (u+vd)*(u+vd) + (ud-v)*(ud-v); 

printf ("\n # / 0 lf # / 0 lf */ 0 lf */ 0 lf # / 0 lf °/ 0 lf 7„lf %lf\n" ,ul,u,uu,ud,vl,v,vu,vd); 

ref = ((u-vd)*(u-vd) + (ud+v)*(ud+v))/din; 
tran = 4*kt/(ki*din); 

printf ("\n\n reflection coeff. = %lf \n\n 
transmission coeff. = %lf \n",ref,tran); 

} 


A.3 Function for rectangular barrier potential 


#include <stdio.h> 
#include <math.h> 


extern double ul,u,uu,vl,v,vu; 
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void scatter(e,del) 
double e,del; 

{ 

double r,pot,rf; 

printf("\n enter barrier height and width (dimensionless) \n"); 

/* Units are discussed in calling program */ 

scanf("°/ 0 lf %lf",&pot,&rf); 

for (r = -del; r > -rf; r -= del) 

{ 

uu = ((pot/e - l)*del*del + 2)*u - ul; 
vu = ((pot/e - l)*del*del + 2)*v - vl; 
ul = u; u = uu; 
vl = v; v = vu; 

} 

} 


A.4 General energy eigenvalue search program 


#include <stdio.h> 

#include <math.h> 

void main() 

{ 

double de,dem,be,del,e,e1,f2nl,f2n; 

int ne,k,j; 

extern double diff(); 

/* diff() defines the specific potential to be used. See next two 
listings for examples. */ 

double zeroinO ; 

FILE *fopenQ , *fp; 


printf("enter data\n\n de,dem,be,ne,k,del\n"); 
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/* Data parameters are defined for equations written with a dimensionless 
position variable ‘r’ (different symbols used for different problems 
in text), ‘de’ is the energy interval for rough linear search. 

‘dem’ is the tolerable error for energy eigenvalues, ‘be’ is the lower 
starting point for energy search, ‘ne’ is the number of 
eigenvalues to be computed, ‘k’ is used with different meanings 
for different potentials. */ 

scanf('7„lf °/„lf 7„lf °/ 0 d %lf %d %lf ’/.If", 

&de,&dem,&be,&ne,&k,&del); 
if((fp=fopen("outdat","r+")) == NULL) 
if((fp=fopen("outdat","w")) == NULL) 
printf("\n error opening outdat\n"); 
if(fseek(fp,0,2)) 
clrerr(fp); 

fprintf(fp,"\n parameters de,dem,be,ne,k,del are \n"); 
fprintf(fp," %f %f %f °/.d °/ 0 f %d °/ 0 f %f",de,dem,be,ne,k,del); 

e=be; f2nl=0; 

f or(j =0;j <ne;j ++) 

{ 

f2n=diff(e,k,del); 

/* diff() defines the specific potential to be used. See next two 
listings for examples. */ 

if(f2nl*f2n<0) 

{ 

el=zeroin(e,de,dem,f2n,k,del); 
printf("\n °/ 0 f %f %f \n", el ,f 2nl ,f 2n); 
fprintf (fp, "\n °/ 0 f %f %f \n", el ,f 2nl ,f 2n); 

} 

f2nl=f2n; e=e+de; 

} 

fclose(fp); 

} 

double 

zeroin(e,de,dem,f2n,k,del) 


double e,de,dem,f 2 n,del; 
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int k; 

{ 

double del,dell,em,ff; 
extern double diff(); 

del=de; 
dell=del; 
while(del>dem) 

{ 

del=del/2; em=e-del; 
dell=dell/2; 
ff=diff(em,k,dell); 
if(ff*f2n>=0.0) 

{ 

e=em; 

em=em-del; 
ff=diff(em,k,dell); 
if(ff*f2n>=0.0) e=em; 

> 

else 

{ 

ff=diff(em,k,dell); 
if(ff*f2n<0.0) e=e+del; 
} 

} 

return(e); 

} 


A.5 Function for the harmonic oscillator potential 


#include <stdio.h> 

#include <math.h> 

double 

diff(e,k,del) 

/* ‘k ; is zero for even wavefunctions and non-zero for odd wavefunctions. 
The odd wavefunction is handled somewhat differently from the text 
for better accuracy. It is taken as an even function multiplied by ‘r’ 
and the even function is then computed numerically. Energy search 
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can be started at zero (the value for ‘be’)- */ 

double e,del; 
int k; 

{ 

double vl,vu,v,r; 

vl = 1; 
v = 1; 
r = del; 

if (k == 0) 

while(abs(v)<10) 

{ 

vu = v*((r*r-e)*del*del +2) - vl; 

r=r+del; 

vl = v; v = vu; 

} 

else 

while(abs(v)<10) 

{ 

vu = v*((r*r-e)*del*del + 2) + vl*(del/r-l); 
vu = vu/(1+del/r); 
r = r+del; 
vl = v; v = vu; 

} 

return(v); 

} 


A.6 Function for the hydrogen atom potential 


#include <stdio.h> 
#include <math.h> 
#define TRUE 1 
#define FALSE 0 


double 

diff(e,k,del) 
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/* ‘k’ is the total angular momentum quantum number ‘1’. Here ‘e’ is not 
quite energy. But it is related to energy. It is the parameter defined 
in the text as the greek letter ‘beta’. One may use ‘be’ as zero for 
this case too. Although, energy eigenvalues are negative and the 
ground state cannot be easily estimated, ‘beta’ can still be found 
to have a lower bound of zero. */ 

double e, del; 
int k; 

{ 

int tail=FALSE; 

register double vl,v,r,vu,ab; 

double dif; 

register double small,large; 
double verysmall; 
small = del/10; 
verysmall = small/10; 
vl = 1; 

v = 1 - e*del/(2*k+2); 
r = del; 
large = 5*vl; 
ab = abs(v); 

while(ab<large && !tail) 

{ 

do 

{ 

vu = ((2+(0.25-e/r)*del*del)*v + ((k+l)*del/r-l)*vl)/(l+(k+l)*del/r); 
r += del; 
vl = v; v = vu; 
ab = abs(v); 

} 

while (ab>small && ab<large); 
dif = abs(v-vl); 

if (dif<verysmall) { large = del; tail = TRUE; } 

} 

if (tail) 

while(abs(v)<large) 

{ 

vu = ((2+(0.25-e/r)*del*del)*v + ((k+l)*del/r-l)*vl)/(l+(k+l)*del/r); 
r += del; 
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vl = v; v = vu; 
} 

return(v); 

} 



Appendix B 


Uncertainties and wavepackets 


In chapter 4, it was seen that both position and momentum cannot be measured precisely 
at the same time. It can also be seen that this would be true for any two noncommuting 
observables. A more precise statement of this fact will be made now in the form of the 
well-known Heisenberg uncertainty principle. First, we need to define the uncertainty in 
the measurement of an observable. The root-mean-squared error is a good measure of 
uncertainty. For the position operator X, this would be 

Ax = j{{X - {X) a )*) a , (B.l) 

where the expectation values are for some given state |s) as defined in equation 4.18. For 
simplicity, a Kronecker delta normalization is assumed for this state: (s|s) = 1. Similarly, 
for the momentum operator P, the measure of uncertainty would be 


A p = y<(P-(P> s ) 2 > s . (B.2) 

Now it can be shown that the product of these uncertainties has a minimum possible value. 
In doing so, we consider the square of the uncertainty product: 



(Ax) 2 (A,pf = (s 4 2 s)(s B 2 s), 

(B.3) 

where 


A = X-(X) S , B = P-(P) S . 

(B.4) 

As A is hermitian 


(A|s»t = < s |At = (s\A. 

(B.5) 

A similar relation is 

true for B as well. Hence, one may define 



Cq 

II 

jh 

II 

(B.6) 
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such that equation B.3 can be written as 

(A;r) 2 (A p) 2 = (u[u)(v\v) > |(«|u)| 2 . 
The above inequality can be proved as follows. 


i \ (wilt), 2 
0 < |n) — -V-l—e-lv 


(B.7) 


= ( u\u) — 


(v\v) 

K«HI 2 


( v \v) 


(B.8) 


Hence the result. Now, using the definition of the commutator bracket, equations B.6 
and B.7 give 


(A.r) 2 (Ap) 2 > 


<*l 


-[A,B] + -(AB + BA) 


\s) 


= ±\(s\[A,B]\s)\ 2 + ±\(s\(AB + BA)\s)\ 2 . 


As the commutator [. X , P} = ih, it is seen that 

[A, B] = ih. 


(B.9) 


(B.10) 


Hence, from equation B.9, we get 

(Ax) 2 (Ap) 2 > h 2 / 4, (B.ll) 

where the term involving {AB + BA) is dropped as it is seen to be non-negative and does 
not change the inequality. This gives the celebrated Heisenberg uncertainty relation to be 


(Ax) (A p) >h/2. 


(B.12) 


This result has been derived for a position operator and the corresponding momen¬ 
tum operator. But it must be true for any two operators with the same commutator. It 
is clearly true for position and momentum in each of the three spatial dimensions. In the 
above derivation, the only place where the commutator relation is actually used is equa¬ 
tion B.9. Hence, an uncertainty relation can be found for any pair of arbitrary operators 
by inserting the appropriate commutator in equation B.9. If the commutator is zero, the 
minimum uncertainty is zero and hence, the corresponding observables can be measured 
simultaneously with indefinite accuracy. 

To get a feel for the minimum uncertainty product, we shall find a one-dimensional 
single particle state which has the minimum possible uncertainty product of position and 
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momentum as allowed by equation B.12. To obtain the minimum, we must find the condi¬ 
tions for the equality options in the inequalities of equations B.7 and B.ll. These conditions 
are quickly seen to be: 


A|s)=AB|s), (B.13) 

(s\(AB + BA)\s) = 0, (B.14) 


where A is a constant yet to be determined. To find |s) in its position representation, we 
write equation B.13 in its position representation (see chapter 3). It reduces to the following 
first order differential equation (using the definitions in equation B.4). 


dip 

dx 


i(x - .x 0 ) ip 0 

\h + h 


*P, 


(B.15) 


where ip is the position representation of |s), x$ = (X) s and po = (P) s - This differential 
equation has the solution: 


ip 


N exp 


i(x — x o) 2 ipox 


2A h 


+ 


n 


(B.16) 


where N, the integration constant, is the normalization constant. Hence, it can be deter¬ 
mined by the following normalization condition. 

1 = (s|s) = J \ip\ 2 dx. (B.17) 

To determine A, we eliminate A in equation B.14 by using equation B.13 and its conjugate: 


(s|A = \*(s\B. 


(B.18) 


This gives 


{\ + \*){s\B 2 \s) = 0. 


(B.19) 


As (s|B 2 |s) = (u|u) is the norm of a nonzero ket, it must be nonzero. Hence, to satisfy 
equation B.19, A must be purely imaginary. A must also be negative imaginary to prevent 
ip from going to infinity at infinity (see equation B.16). We can also relate A and N to the 
position uncertainty Ax by using the following condition on ip which really is the definition 
given in equation B.l written in the position representation. 


J (x — x 0 ) 2 \ip\ 2 dx = (Ax) 2 . 


(B.20) 


Now, the conditions in equation B.17 and B.20 will give 


(x - x 0 ) 2 
4(Ax) 2 


ip 0 x 


ip = [27t(Ax) 2 ] 1 ^ 4 exp 


n 


(B.21) 
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It can be seen that the ground state of the harmonic oscillator is exactly this state. As 
it is an eigenstate of the hamiltonian, it does not change with time. The same minimum 
uncertainty state is also possible for the free particle. However, for the free particle, it is 
not an eigenstate of the hamiltonian and hence, it does change with time. The change, 
with time, of the free particle minimum uncertainty state can be seen to be similar to that 
of the position eigenstates as discussed in chapter 4. The wavefunction spreads with time 
and does not remain a minimum uncertainty state. 

The minimum uncertainty state is sometimes called the minimum uncertainty wave- 
packet. The term “wavepacket” is loosely used for any wavefunction that is localized in a 
small region of space. The minimum uncertainty wavepacket can be visualized as a gaussian 
(bell shaped) wavefunction that has a forward motion in time with momentum jjq. 
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